Gene TM1040_3720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3720 
Symbol 
ID4075427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp779668 
End bp780957 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content61% 
IMG OID638005240 
Productglutamate--ammonia ligase 
Protein accessionYP_611949 
Protein GI99078691 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0174] Glutamine synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.698416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.273252 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACTC GCCTGCGCGC GTTATTCTGC GACCATCTCA GTATCATGCG CGGGAAATAC 
CTGCCGCATT CAAAGATCGG CGACGATGAA ACCCGGTTCT GCCGCTCTGT GTTTGGCACC
CATTATGACC GCGACCTGCT GGACGCGCCG GGGTCGATGG TCAAACAGGG CCTACCAGAC
ATGACGCTGC GCTGGCGTCA CGATGACATT CGCGACAGTT GGCACGCCTC GACCAAAATC
GTCTTGGGCG ATCTTTATGA CGACGAGGGC GAGCTGCTGA CGCTTTGTCC CAGAGGGGCG
TTGAAACGCG CCGTTGCGGA TTGGCAGGGA AAGGGGCTTT CTCCAAAAAT TGGTATCGAA
CTTGAGGCTT TTGCCCTGCA GCCCGACGAA TACGGGCGGC TTGTCCCCTA TGATGCGCCC
GGCGGAGTGG TCTACGGCAC CGGGCCGTTT GCAGATCCAT TACGGTTCAA TGACCGGATC
TGGGCGATGG CGGATGAGAT GGGCTTCTCT CTCGACATGA TTACGGCGGA GTTCGACAGC
CCTCAGTTTG AATATACGCT GACCTTTGAC GACGCTGTAA AGGCGGTGGA TGACATCGTG
CTGTTTCGCT TGATGGCGCG TGAGATCGCG CTGGAGTACG GGATCGTTCT GACGTTCATG
CCCAAGCCGG TCGCGCAGGC AGGGGGGTCA GGCATGCATG TGAACCTCTC GTTCACGGAT
GAGGCGGGGG GAAATGCGCT TTCGTCGGGG CCTCGGGGCG GGCCGGATCA CATGAATGAT
CTCGCGCGCG GCTGCCTTGC CGGGTTTCTG CATCATCACA AGGGCTTGGC CGGTCTGATC
GCGCCCACCG CCAACAGCTA CATGCGTCTG CAACCGGGGA GTCTGTCGGG CTTTTGGCAG
AACTGGGGCG GCGATCATCG CAATGTCACC ACTCGGATCA GCTCCGAAGG CGGGGCGAAG
GCGCGGCTTG AACACCGAAT GGCGGATGCC TCCTCCAATC CCTATACCAC GGTGGCGGCG
CTCTTGCAGG CGGCGCGCCT TGGCGTGGAG CGCGGCTATG CGCTGGGACC GATGGAAACC
GGCGATGGGT TTGACCGCAC GGACACGCGC GAAAGCACCG CAATGACGCT CAAGGGCGCG
GTCGCAGATC TGGAAAAGGA TACCTCCCTT GCGGAGGCGG TGGGGCCGGA TCTGGTCGCC
AATCATGTCT ACATGAAGCA GAAAGAGGTC CGCAAAACCC GCGACCTCGA AGGCGATGCG
CTGCGGGACT TCTACGTGCA TTTTGTCTGA
 
Protein sequence
MKTRLRALFC DHLSIMRGKY LPHSKIGDDE TRFCRSVFGT HYDRDLLDAP GSMVKQGLPD 
MTLRWRHDDI RDSWHASTKI VLGDLYDDEG ELLTLCPRGA LKRAVADWQG KGLSPKIGIE
LEAFALQPDE YGRLVPYDAP GGVVYGTGPF ADPLRFNDRI WAMADEMGFS LDMITAEFDS
PQFEYTLTFD DAVKAVDDIV LFRLMAREIA LEYGIVLTFM PKPVAQAGGS GMHVNLSFTD
EAGGNALSSG PRGGPDHMND LARGCLAGFL HHHKGLAGLI APTANSYMRL QPGSLSGFWQ
NWGGDHRNVT TRISSEGGAK ARLEHRMADA SSNPYTTVAA LLQAARLGVE RGYALGPMET
GDGFDRTDTR ESTAMTLKGA VADLEKDTSL AEAVGPDLVA NHVYMKQKEV RKTRDLEGDA
LRDFYVHFV