Gene EcolC_4146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4146 
SymbolglnA 
ID6066327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4574034 
End bp4575443 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content53% 
IMG OID641603567 
Productglutamine synthetase 
Protein accessionYP_001727070 
Protein GI170022116 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0174] Glutamine synthetase 
TIGRFAM ID[TIGR00653] glutamine synthetase, type I 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.216906 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTG AACACGTACT GACGATGCTG AACGAGCACG AAGTGAAGTT TGTTGATTTG 
CGCTTCACCG ATACTAAAGG TAAAGAACAG CACGTCACTA TCCCTGCTCA TCAGGTGAAT
GCTGAATTCT TCGAAGAAGG CAAAATGTTT GACGGCTCCT CGATTGGCGG CTGGAAAGGC
ATTAACGAGT CCGACATGGT GCTGATGCCA GACGCATCCA CCGCAGTGAT TGACCCGTTC
TTCGCCGACT CCACCCTGAT TATCCGTTGC GACATCCTTG AACCTGGCAC CCTGCAAGGC
TATGACCGTG ACCCGCGCTC CATTGCGAAG CGCGCCGAAG ATTACCTGCG TTCCACTGGC
ATTGCCGACA CCGTACTGTT CGGGCCAGAA CCTGAATTCT TCCTGTTCGA TGACATCCGT
TTCGGATCAT CTATCTCCGG TTCTCACGTT GCTATCGACG ATATCGAAGG CGCATGGAAC
TCCTCCACCC AATATGAAGG TGGTAACAAA GGTCACCGTC CGGCAGTGAA AGGCGGTTAC
TTCCCGGTTC CGCCGGTAGA CTCTGCTCAG GATATTCGTT CTGAAATGTG TCTGGTGATG
GAACAGATGG GCCTGGTGGT TGAAGCCCAT CACCACGAAG TAGCGACTGC TGGTCAGAAC
GAAGTGGCTA CCCGCTTCAA TACCATGACC AAAAAAGCTG ACGAAATTCA GATCTACAAA
TATGTTGTGC ACAACGTAGC GCACCGCTTC GGTAAAACCG CGACCTTTAT GCCAAAACCG
ATGTTCGGTG ATAACGGCTC CGGTATGCAC TGCCACATGT CTCTGTCTAA AAACGGCGTT
AACCTGTTCG CAGGCGACAA ATACGCAGGT CTGTCTGAGC AGGCGCTGTA CTACATTGGC
GGCGTAATCA AACACGCTAA AGCGATTAAC GCCCTGGCAA ACCCGACCAC CAACTCTTAT
AAGCGTCTGG TCCCGGGCTA TGAAGCACCG GTAATGCTGG CTTACTCTGC GCGTAACCGT
TCTGCGTCTA TCCGTATTCC GGTGGTTTCT TCTCCGAAAG CACGTCGTAT CGAAGTACGT
TTCCCGGATC CGGCAGCTAA CCCGTACCTG TGCTTTGCTG CCCTGCTGAT GGCCGGTCTT
GATGGTATCA AGAACAAGAT CCATCCGGGC GAAGCCATGG ACAAAAACCT GTATGACCTG
CCGCCAGAAG AAGCGAAAGA GATCCCACAG GTTGCAGGCT CTCTGGAAGA AGCACTGAAC
GAACTGGATC TGGACCGCGA GTTCCTGAAA GCCGGTGGCG TGTTCACTGA CGAAGCAATT
GATGCGTACA TCGCACTGCG TCGCGAAGAA GATGACCGCG TGCGTATGAC TCCGCATCCG
GTAGAGTTTG AGCTGTACTA CAGCGTCTAA
 
Protein sequence
MSAEHVLTML NEHEVKFVDL RFTDTKGKEQ HVTIPAHQVN AEFFEEGKMF DGSSIGGWKG 
INESDMVLMP DASTAVIDPF FADSTLIIRC DILEPGTLQG YDRDPRSIAK RAEDYLRSTG
IADTVLFGPE PEFFLFDDIR FGSSISGSHV AIDDIEGAWN SSTQYEGGNK GHRPAVKGGY
FPVPPVDSAQ DIRSEMCLVM EQMGLVVEAH HHEVATAGQN EVATRFNTMT KKADEIQIYK
YVVHNVAHRF GKTATFMPKP MFGDNGSGMH CHMSLSKNGV NLFAGDKYAG LSEQALYYIG
GVIKHAKAIN ALANPTTNSY KRLVPGYEAP VMLAYSARNR SASIRIPVVS SPKARRIEVR
FPDPAANPYL CFAALLMAGL DGIKNKIHPG EAMDKNLYDL PPEEAKEIPQ VAGSLEEALN
ELDLDREFLK AGGVFTDEAI DAYIALRREE DDRVRMTPHP VEFELYYSV