Gene Maqu_0455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMaqu_0455 
Symbol 
ID4657424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinobacter aquaeolei VT8 
KingdomBacteria 
Replicon accessionNC_008740 
Strand
Start bp518918 
End bp520207 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content62% 
IMG OID639810407 
Productproline dipeptidase 
Protein accessionYP_957743 
Protein GI120553392 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAATC AAGACATCCT CGACCTGCAG CCTGCCCACA TCCGTGAGCT GCAGAACCGG 
TACGAACGGG CCATGGCTGA TCACGGTTAT GACAGCCTGC TGATTGCCTC GGGCGCGGCG
CCCTACCGGT ATCGGGATGA TCAGGCCTAT GTATTTCAGG GCTTCGGCCC GTTTTTGCAC
TGGACCGGGC TGGCTGGCCA GGAGCACAGC TGGCTGTTGG TTCGGCCCGG CCAGAAACCG
GTGCTCTGGC TATACCAGCC GGTGGACTTC TGGCACGCCA ACCCGGTGAT GGCTGAAGAA
CCCTGGCTAG AGGTTGTAGA CGTGCGCAGC CGGCAGCAAT CTGGCGCCCC GGAGCTCGAT
AGCCCGGGGC GGCTGGTGGT AATCGGTGAT CCGCTGGTGC TGGATGGCGT GCCGGGAGAT
CATAACCCGG CGGCACTGGT GGCAGCCGTG GAAGAAGCCC GCGTGCGCAA AACTCCTTAC
GAAATTGAAT GCCTGGCACA CGCTAACCGG ATCGCCCTGG CTGGTCACAG GGCGGCCAGA
GAGGCGTTTC TGGCCGGCGA CAGCGAATTC GGTATCAATC TCGCCTACCA GCAGGCCACC
GGCCAGCGGG AAGTGGAGGC GCCTTATCAC AGCATCATCG GATTGAACGA ACATGCCGGC
ACACTGCACT ACCAGTACTA CGACCTGAAG CCTCCCCGCC GGCCCCGCAG TCTGCTGATT
GATGCCGGGG TGCGCTACCG TGGCTACTGT TCAGATATAA CCCGCACCAC CGCCGGGCCG
GATGAGCCGA GATTCACCGC GCTGGTTCAC GGGCTGGAGA AACTGCAGCT GCGGCTATGC
GAGATGGTGT CGCCCGGCGT GGACTACGTG GACATTCACC GCAAGGCTCA CCTGGGCATT
GCGACCCTGC TATCAGCTTC GGGGCTGGTG TCCGGTTTAG CGGATGAAGC CATGGTGGAG
CAGGGCATTA CCCGGGCGTT CTTTCCCCAT GGCATAGGGC ATTTTCTGGG CATTCAGGTG
CACGATGTCG CCGGCAAACC GACGCCGTCC CCGGAAGATG CGCCGTTCCT GCGCCTGACC
CGCACCCTGG AAGCCGGCAT GGTGGTCACC ATCGAGCCGG GGCTCTATTT TATTCCATCG
CTGCTGGAGC CTTTGCTGAA CGGGCCTGAA GCCCAGTATC TGAACCGGGC GCTGATTGAT
GAACTGAAGA GCTGTGGTGG AATCCGGATT GAAGATAATG TGGTGGTCAC AGCGGCCGGG
GCCCGGAACT TAACGCGAGA ATGCGAATAG
 
Protein sequence
MPNQDILDLQ PAHIRELQNR YERAMADHGY DSLLIASGAA PYRYRDDQAY VFQGFGPFLH 
WTGLAGQEHS WLLVRPGQKP VLWLYQPVDF WHANPVMAEE PWLEVVDVRS RQQSGAPELD
SPGRLVVIGD PLVLDGVPGD HNPAALVAAV EEARVRKTPY EIECLAHANR IALAGHRAAR
EAFLAGDSEF GINLAYQQAT GQREVEAPYH SIIGLNEHAG TLHYQYYDLK PPRRPRSLLI
DAGVRYRGYC SDITRTTAGP DEPRFTALVH GLEKLQLRLC EMVSPGVDYV DIHRKAHLGI
ATLLSASGLV SGLADEAMVE QGITRAFFPH GIGHFLGIQV HDVAGKPTPS PEDAPFLRLT
RTLEAGMVVT IEPGLYFIPS LLEPLLNGPE AQYLNRALID ELKSCGGIRI EDNVVVTAAG
ARNLTRECE