Gene Rsph17029_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2031 
Symbol 
ID4897743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2152400 
End bp2153755 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content66% 
IMG OID640112624 
Productglutamate--ammonia ligase 
Protein accessionYP_001043906 
Protein GI126462792 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0174] Glutamine synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.794004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACT GGACCGACAG ACTTCCCGAA GCCGCCCGCG CCTACATCGC AGACCGCCGG 
GTGGACGAAG TGGAGTGCAT CCTCTCCGAC ATCGCGGGCG TGGCGCGCGG CAAGGCCATG
CCTGCCTTCA AGTTCGGCAA GCAGTCGAGC TTCTTCCTGC CGAACTCGAT CTTCCTGCAG
ACCATCACCG GCGAATGGGC CGACAATCCC TCGGGCGCCT TCACCGAGCC CGACATGATC
CTGATCCCGG ACTATTCCAC CGCGACCGCC GCGCCCTGGA CGGCGGATAT CACCTTGCAG
GTGATCCACG ATGCGGTGGA CCAGCAGGGC CGGCCGGTGC CCGTCTCGCC GCGCAACGTG
CTGCGGCGGG TGGTCGAGCT TTACAATGCG GAAGGCTGGA CGCCGATCGT GGCGCCGGAG
ATGGAGTTCT TCCTCGTCGC GCGCAACATC GACCCCAACA TGCCGGTCAT GCCGCCCATG
GGCCGGACGG GCCGCCGTGC GGCGGCCAAG CAGGCCTATT CCATGTCCGC GGTGGACGAA
TACGGCAAGG TGATCGACGA CATCTACGAC TTCGCCGAGG CGCAGGGTTT CGAGATCGAC
GGGATCCTGC AGGAGGGCGG CGCGGGTCAG GTCGAGATCA ACCTCGCTCA TGGCGACCCG
GTGGCTCTGG CCGACCAGAT CTTCTTCTTC AAGCGGCTGA TCCGCGAGGC CGCGCTGCGC
CACGACTGTT TCGCGACCTT CATGGCCAAG CCCATCGAGG GCGAGCCGGG CTCGGCCATG
CACATCCACC ATTCGGTCGT CGACAGCGCG AGCAAGCTCA ACATCTTCTC GGATGCCAAG
GGCGGCGAAA CCGAGGCCTT CCTCCATTTC ATCGCGGGCA TGCAGACGCA CCTGCCCGCG
GCGGTCGCAC TGCTTGCGCC CTACGTCAAC AGCTACCGCC GCTACGTCCC GGACTTCGCG
GCCCCGATCA ACCTCGAATG GGGACGCGAC AACCGAACGA CAGGGCTGCG CGTGCCGATC
TCGGGGCCCG AGGCGCGGCG GCTCGAGAAC CGGCTGGCCG GGATGGACTG CAACCCCTAC
CTCGGGCTCG CGGCGTCGCT CGCCTGCGGC TATCTGGGGC TGAAGGAGCG GAAGATGCCG
CAGCCCGAAT GCACGGGCGA CGCCTACATG TCCGAGACGG ATCTGCCCTA CAACCTCGGC
GATGCGCTCG ACCTGCTCGA GGAGGACGCG GCCCTGCGCG ACGTGCTGGG GCCCGAGTTC
TGCGGCGTCT ACGATTCGGT CAAGCGCAAC GAATACAAGG AGTTCCTGCA GGTCATCAGC
CCGTGGGAGC GCGAGCATCT GCTGCTGAAC GTATGA
 
Protein sequence
MSDWTDRLPE AARAYIADRR VDEVECILSD IAGVARGKAM PAFKFGKQSS FFLPNSIFLQ 
TITGEWADNP SGAFTEPDMI LIPDYSTATA APWTADITLQ VIHDAVDQQG RPVPVSPRNV
LRRVVELYNA EGWTPIVAPE MEFFLVARNI DPNMPVMPPM GRTGRRAAAK QAYSMSAVDE
YGKVIDDIYD FAEAQGFEID GILQEGGAGQ VEINLAHGDP VALADQIFFF KRLIREAALR
HDCFATFMAK PIEGEPGSAM HIHHSVVDSA SKLNIFSDAK GGETEAFLHF IAGMQTHLPA
AVALLAPYVN SYRRYVPDFA APINLEWGRD NRTTGLRVPI SGPEARRLEN RLAGMDCNPY
LGLAASLACG YLGLKERKMP QPECTGDAYM SETDLPYNLG DALDLLEEDA ALRDVLGPEF
CGVYDSVKRN EYKEFLQVIS PWEREHLLLN V