Gene TM1040_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1934 
Symbol 
ID4076885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2036243 
End bp2037481 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content60% 
IMG OID638007250 
Productaspartate kinase 
Protein accessionYP_613929 
Protein GI99081775 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.167611 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTAC TTGTGATGAA ATTCGGCGGC ACATCCGTCG CCAATCTGGA CCGCATTCGC 
CGCGCTGCCA AACGCGTTGG TGTCGAAGTG GCCAAAGGCT ATGACGTGAT CGTCATTGTC
TCCGCCATGT CCGGCAAGAC CAACGAGCTG GTCGGTTGGG TGGGGGAGAC CTCGCCGCTC
TATGATGCGC GTGAATATGA TGCGGTTGTA TCCTCTGGTG AGAATGTGAC CGCGGGCCTC
ATGGCGTTGA CGCTGCAAGA GATGGACGTG CCCGCGCGCA GCTGGCAGGG CTGGCAAGTG
CCGCTCAAGA CCAACTCGGC CCACAGCCAG GCCCGGATCG AAGAGATCGG CACAGAGAAC
ATCAACCAGA AGTTCGGCGA AGGCATGAAA GTGGCCGTTG TTGCGGGCTT TCAGGGGATT
TCTCCCGAAG GTCGCATCAC CACCCTCGGG CGCGGCGGCT CTGACACCAC AGCGGTGGCT
TTTGCGGCGG CCTTCGGGGC GGAGCGCTGC GATATCTACA CCGATGTGGA CGGCGTCTAT
ACCACCGACC CGCGCATCTG CGAAAAGGCA CGCAAGCTCG ACAAGATCGC CTTTGAGGAA
ATGCTGGAGC TGGCATCCTT GGGCGCCAAG GTGCTGCAAA CCCGCTCCGT CGAGCTGGCG
ATGCGCTACA AGGTGAAACT GCGCGTGCTC TCGAGCTTTG AAGAACAGTC CGACGAGGCC
GGAACCCTGG TCTGCGACGA GGAGGAAATC ATGGAATCCA ATGTTGTTAA CGGCGTTGCC
TACTCGCGGG ATGAGGCCAA ACTGACCTGT CTTTCGGTCG CGGACCGTCC GGGCATCGCG
GCGACCATTT TTGGCTGCCT CTCGGATGCC GGCGTCAACG TCGATATGAT CGTGCAGAAC
ATCTCTGAAG ATGGGCGCAC GGATATGACG TTCTCTTGCC CCACGGATCA GGTACAGCGC
GCGGAAATGG CCCTGAACGC CTACAAAGAG AAGGGCGAGC TGAACTTTGC TGAACTCGTG
GCGGACACCG GTGTTGCGAA GATTTCGGTG GTGGGCATCG GCATGCGATC GCAGTCCGGT
GTGGCCGCCA AGATGTTCAA GGTCCTCTCG GATGAGGGCA TCAACATCAA GGTGATCACC
ACCTCCGAGA TCAAGATTTC GGTGCTGGTG GACCGCAAAT ACATGGAGCT CGCCGTGCAG
GCCCTGCACG ACGCCTTTGA GCTCGACAAA GCCAGCTGA
 
Protein sequence
MPLLVMKFGG TSVANLDRIR RAAKRVGVEV AKGYDVIVIV SAMSGKTNEL VGWVGETSPL 
YDAREYDAVV SSGENVTAGL MALTLQEMDV PARSWQGWQV PLKTNSAHSQ ARIEEIGTEN
INQKFGEGMK VAVVAGFQGI SPEGRITTLG RGGSDTTAVA FAAAFGAERC DIYTDVDGVY
TTDPRICEKA RKLDKIAFEE MLELASLGAK VLQTRSVELA MRYKVKLRVL SSFEEQSDEA
GTLVCDEEEI MESNVVNGVA YSRDEAKLTC LSVADRPGIA ATIFGCLSDA GVNVDMIVQN
ISEDGRTDMT FSCPTDQVQR AEMALNAYKE KGELNFAELV ADTGVAKISV VGIGMRSQSG
VAAKMFKVLS DEGINIKVIT TSEIKISVLV DRKYMELAVQ ALHDAFELDK AS