Gene Ava_4408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4408 
SymboltrpD 
ID3680535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5526446 
End bp5527534 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content46% 
IMG OID637719761 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_324901 
Protein GI75910605 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.289652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACTT CCCCAATCCC TACCCAAGAA TCTTCTACTA GTTGGTATCT TCTACTGCAA 
CAATTAATAG ATGGTGAATC TTTAAGTCGA TCGCAAGCTG CTGAATTGAT GCAAGGTTGG
CTTAGTGAAG CCGTACCTCC AGAGTTATCA GGAGCAATCT TAACAGCACT CAACTTTAAA
GGCGTTTCTG CCGATGAGTT GACTGGTATG GCTGAAGTAC TACAATCTCA ATCTAAATTG
GGGAGTGGAG AAAATTCTTC CCAATTACCC ATTACCAATT ACCAATTCCC CATAATCGAT
ACTTGTGGCA CTGGTGGCGA CGGGTCATCA ACTTTTAACA TTTCTACTGC TGTGGCGTTT
GTGGCGGCTG CTTATGGTGT ACCTGTTGCC AAGCATGGTA ATCGTTCGGC TTCGAGTTTG
ACGGGTAGTG CCGATGTTTT AGAAGCTCTG GGTGTTAACT TGGGTGCTTC TAGTGAAAAA
GTACAAGCTG CTCTGCAAGA AGTCGGGATC ACATTTTTGT TTGCTCCCGG TTGGCATCCT
GCATTAAAAG CGGTGGCTAC TTTGCGACGG ACTTTAAGAA TCCGCACGGT GTTTAATTTG
CTGGGGCCGT TGGTCAATCC TTTGCGTCCC ACAGGACAAG TGGTGGGGTT ATTTACTCCC
AAACTTTTGA CAACTGTTGC CCAAGCTTTA GATAATTTGG GTAAGCAAAA GGCGATCGTC
TTACATGGAC GAGAAAGGCT GGATGAGGCT GGGTTGGGTG ATTTAACTGA CTTAGCAGTA
TTATCTGATG GTAAGCTACA GTTAACTACG ATAAATCCCC AGGAAGTGGG TGTGACACCT
GCTCCTATTG GCGCACTCCG GGGTGGGGAT GTACAAGAAA ATGCGGAGAT TCTCAAAGCT
GTATTGCAAG GCCAAGGAAC CCAAGCACAA CAGGACGCTG TAGCTTTAAA CGCGGCTTTG
GCGCTACAGG TGGCGGGTGC AGTCCCATTA TTAGACCATG CCAAAGGTGT GAGTGTAGCT
AAGGAGATCC TACAGACTGG TACTGCTTGG GCAAAATTGG AACAATTGGT ACACTTTCTG
AAGAGTTAG
 
Protein sequence
MTTSPIPTQE SSTSWYLLLQ QLIDGESLSR SQAAELMQGW LSEAVPPELS GAILTALNFK 
GVSADELTGM AEVLQSQSKL GSGENSSQLP ITNYQFPIID TCGTGGDGSS TFNISTAVAF
VAAAYGVPVA KHGNRSASSL TGSADVLEAL GVNLGASSEK VQAALQEVGI TFLFAPGWHP
ALKAVATLRR TLRIRTVFNL LGPLVNPLRP TGQVVGLFTP KLLTTVAQAL DNLGKQKAIV
LHGRERLDEA GLGDLTDLAV LSDGKLQLTT INPQEVGVTP APIGALRGGD VQENAEILKA
VLQGQGTQAQ QDAVALNAAL ALQVAGAVPL LDHAKGVSVA KEILQTGTAW AKLEQLVHFL
KS