Gene Avin_02120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_02120 
SymboltrpB 
ID7759173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp205981 
End bp207201 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content66% 
IMG OID643803137 
Producttryptophan synthase subunit beta 
Protein accessionYP_002797448 
Protein GI226942375 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGA CTTCCTACCG CACCGGCCCC GACGAAAAAG GCCTGTTCGG CCGTTTCGGC 
GGCCAGTACG TCGCCGAAAC CCTGATGCCG CTGATCCTCG ACCTGGCCGA GGAATACGAG
AGGGCCAAGG TTGATCCGGC CTTCCTCGAA GAACTGGCCT ACTTCCAGCG CGACTACGTC
GGCCGGCCGA GCCCGCTGTA TTTCGCCGAG CGCCTGACCG AGCACTGCGG CGGCGCGAAG
ATCTACCTCA AGCGCGAAGA GCTGAACCAC ACCGGCGCGC ACAAGATCAA CAACTGCATC
GGGCAAATCC TGCTGGCCCG GCGCATGGGC AAGCAGCGCA TCATCGCCGA GACCGGCGCC
GGCATGCACG GCGTGGCCAC CGCCACCGTG GCCGCGCGCT TCGGCCTGCA GTGCGTGATC
TACATGGGCA CCACCGACAT CGATCGCCAG CAGGCCAACG TCTTCCGCAT GAAGCTTCTT
GGCGCCGAGG TGATCCCGGT CACCGCCGGC ACCGGCACCC TCAAGGACGC CATGAACGAG
GCCCTGCGCG ACTGGGTGAC CAACGTCGAG ACCACCTTCT ACCTGATCGG CACCGTGGCC
GGCCCGCATC CGTACCCGGC GATGGTCCGC GATTTCCAGG CGGTGATCGG CAAGGAAACC
CGCGAGCAAC TGATCGAGAA GGAAGGGCGC CTGCCCGACT CGCTGGTCGC CTGCATCGGC
GGCGGCTCCA ACGCCATGGG CCTGTTCCAC CCCTTCCTCG ACGAGCCGGG CGTGAAGATC
GTCGGCGTCG AGGCCGCCGG CCACGGCATC GAGACCGGCA AGCACGCGGC CAGCCTGAAC
GGCGGCGTGC CCGGCGTGCT GCACGGCAAC CGCACCTTCC TGCTGCAGGA CGCCGACGGC
CAGATCATCG ACGCCCACTC GATCTCCGCC GGCCTCGACT ACCCCGGCAT CGGCCCGGAA
CATGCTTGGC TACACGACAT CGGCCGCGTC GAGTACAGCT CGATCACCGA CCATGAAGCG
CTGCAGGCCT TCCATACCTG CTGTCGCCTG GAGGGCATCA TCCCGGCGCT GGAGTCGTCC
CATGCCCTGG CCGAAGTGTT CAAGCGCGCG CCCCGGCTTC CGAAAGACCA CCTGATGGTG
GTCAACCTCT CCGGCCGCGG CGACAAGGAC ATGCAGACCG TGATGCATCA CATGCAGGAA
AAACTGGAGA AGCACGCATG A
 
Protein sequence
MTETSYRTGP DEKGLFGRFG GQYVAETLMP LILDLAEEYE RAKVDPAFLE ELAYFQRDYV 
GRPSPLYFAE RLTEHCGGAK IYLKREELNH TGAHKINNCI GQILLARRMG KQRIIAETGA
GMHGVATATV AARFGLQCVI YMGTTDIDRQ QANVFRMKLL GAEVIPVTAG TGTLKDAMNE
ALRDWVTNVE TTFYLIGTVA GPHPYPAMVR DFQAVIGKET REQLIEKEGR LPDSLVACIG
GGSNAMGLFH PFLDEPGVKI VGVEAAGHGI ETGKHAASLN GGVPGVLHGN RTFLLQDADG
QIIDAHSISA GLDYPGIGPE HAWLHDIGRV EYSSITDHEA LQAFHTCCRL EGIIPALESS
HALAEVFKRA PRLPKDHLMV VNLSGRGDKD MQTVMHHMQE KLEKHA