Gene Ava_4721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4721 
Symbol 
ID3679701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5902681 
End bp5904195 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content47% 
IMG OID637720077 
Productanthranilate synthase, component I 
Protein accessionYP_325213 
Protein GI75910917 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTTCC CCAATTTCTC CGAATTTTCT CAGCTAGCTT TGCAAGGTAA TTTTGTGCCG 
GTGTATCAGG AATGGGTGGC AGACCTAGAT ACACCAGTTT CCGCTTGGTA CAAAGTTTGC
GCTGGAGAGC CTTATAGCTT TTTGCTGGAA TCGGTAGAAG GTGGGGAAAA GATTGGGCGT
TATAGTCTGC TGGGGTGTGA TCCTTTATGG ATTTTGGAAG CTAGGGGTGA GACGACCACT
CAGACAAACC GTGATGCTTC TCAGGTTGTG TTTACAGGTG ATCCATTTAC AGCTTTAGCC
GATTGTCTAG CACCATATAA ACCAGTCAAG TTACCCCAAC TACCACCAGG AATTGGCGGT
TTGTTCGGTT TTTGGGGTTA TGAATTAATT CGTTGGATTG AACCGCGTGT ACCAGTGCAT
TCCCAAGATG AGCGCAACAT TCCTGATGGA TTGTGGATGC AGGTAGACCA TTTGTTAATA
TTTGACCAGG TAAAGCGGAA AATATGGGCG ATCGCCTATG CTGATTTACG TGATCCAAAT
GTAGATTTAA AAGCAGCGTA TCAGCAAGCG TGCGATCGCG TCACCCAAAT GGTGAGCAAA
CTATCTTTGC CCGTATCGCC ACAAAAAACC ATATTAGAAT GGACACCCCC AGGCAGCCAA
AACGTGGGGG AAACACAAGA ATATACCAGC AACTTCACTC GACCGGAATT TTGCGCCAGC
GTCCAAAAAG CCAAAGAGTA CATCAAAGCG GGTGATATCT TTCAAGTAGT CATTTCTCAA
CGCCTTTCCA CCCAATACAC CGGCGATCCC TTCTCTCTGT ACCGTTCCCT GCGTCAGATC
AATCCCTCGC CCTACATGGC GTACTTTAAC TTCCAAGACT GGCAAATCAT CGGTTCCAGC
CCCGAAGTGA TGGTAAAAGC GGAACTGGAT GGAGATGGTG AAGCAGTAGC CACAGTTCGC
CCCATTGCGG GGACTCGTCC ACGGGGTAAG ACAACCAAAG AAGATGCAGA ATTAGCCGCA
GATTTACTCC AAGACCCCAA AGAAATCGCC GAACACGTCA TGCTGGTGGA TTTAGGACGC
AACGATTTAG GGCGTGTTTG TGCCAGTGGG ACTGTAAAAG TTGATGAATT AATGGTGGTG
GAACGTTACT CTCATGTAAT GCACATTGTG AGTAATGTTG TAGGGAAATT AGCCGCAAAT
AAAAACGCCT GGGATTTATT AAAAGCCTGT TTTCCCGCCG GCACAGTTAG CGGCGCACCA
AAAATTAGGG CGATGGAAAT CATTAATGAG TTAGAACCCA GTCGGCGGGG TGTGTATTCT
GGGGTGTATG GCTATTACGA TTTTGAAGGG CAATTAAATA GTGCGATCGC TATTCGGACA
ATGGTAGTCA GAGATCACAC TGTAACGGTA CAAGCTGGTG CAGGTTTGGT GGCTGATTCT
GACCCAGAAA AAGAATATGA GGAAACTTTA AATAAAGCTA GGGGACTTCT CTTGGCAATT
CGCTGCTTAC GCTAG
 
Protein sequence
MIFPNFSEFS QLALQGNFVP VYQEWVADLD TPVSAWYKVC AGEPYSFLLE SVEGGEKIGR 
YSLLGCDPLW ILEARGETTT QTNRDASQVV FTGDPFTALA DCLAPYKPVK LPQLPPGIGG
LFGFWGYELI RWIEPRVPVH SQDERNIPDG LWMQVDHLLI FDQVKRKIWA IAYADLRDPN
VDLKAAYQQA CDRVTQMVSK LSLPVSPQKT ILEWTPPGSQ NVGETQEYTS NFTRPEFCAS
VQKAKEYIKA GDIFQVVISQ RLSTQYTGDP FSLYRSLRQI NPSPYMAYFN FQDWQIIGSS
PEVMVKAELD GDGEAVATVR PIAGTRPRGK TTKEDAELAA DLLQDPKEIA EHVMLVDLGR
NDLGRVCASG TVKVDELMVV ERYSHVMHIV SNVVGKLAAN KNAWDLLKAC FPAGTVSGAP
KIRAMEIINE LEPSRRGVYS GVYGYYDFEG QLNSAIAIRT MVVRDHTVTV QAGAGLVADS
DPEKEYEETL NKARGLLLAI RCLR