Gene Aazo_3791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3791 
Symbol 
ID9341596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3851123 
End bp3852388 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content39% 
IMG OID 
Product6-pyruvoyl tetrahydropterin synthase and hypothetical protein 
Protein accessionYP_003722447 
Protein GI298492270 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTGTG TAGTCAATCG TCGCGCTCAG TTTTTAGCTA GTCATCATTA TTCGCTACCA 
GAACTAGGCG AAACCGAGAA CTTGGAAAAA TTTGGTCGTT CCTCAAAATT TCCCGGGCAA
GGCCACAATT ATACCCTATT CATCTCTATA GCTGGGGAAT TAGATAAATA TGGTATGGTG
CTAAATTTAT CTGATGTCAA ACAAGTAATT AAGCGGGAAA TAACCGATCA ATTAGATTTT
TCTTACCTTA ATAATGTGTG GCCAGAATTT CAACAAACTT TGTCCAGTAA TGAAAATATT
GCACGAGTGA TCTGGCAGCG TTTAGCCCCT CATTTGCCTC TAATTCGCGT CCAGTTGGTT
GAACATACTG GGCTTTGGGC AGATTATATG GGAGAAGGAA TGCAAGCTTC TCTCAGCATC
AGCACCCACT TTAGCGCCGC CCATCGTTTG GCTTCTAACC TCAGTTCTGA AAAGTATAGT
AAATGTAGCC GTACACATGG ACACAACTAC CATTTAGAAG TGACTGTAGA AGGGGAAATG
GACTCACGAA CAGGGATGAT CATTGATTTA GATGCCCTAA ATAGAGTTGT TGAAAATGAT
GTAGTCAAAA TCTTTGATCA CTTCTGTGTA AATAAAGATA TTCCTCATTT TTCCGAAATT
GTCCCGACTA CCGAAAATCT TGTACTTTAC ATTAGCAACC TACTCAAATC ACCTATTCAG
AAATTAGGGG CAAAACTGTC TCAAGTTAAG CTGTTTGAAA GTCCTCAACT CTGGGTAGAT
TATCAGGGTA ATGGAACAGA AACGTTCCTC ACCGTGAAGA GTGAATTTAG TTCTGCACAC
AGATTAGCTC ATCCTGGTTT GAGTTTAGAA AAAAATACAG AGATTTACGG TAAATGCGCC
CGTGTGAATG GACATGGACA TAACTATCAA TTAGAAGTGA CAGTGAAGGG TGAAATCGAC
TCCAGCACAG GTATGGTGGT TGATTTAGGT GCTTTAAATC AGGTAATTGC TAATTTAACT
GAACCCCTTG ATCACAGTTT CTTAAATAGA GATGTTCCCT ATTTTGGGGA AGTTGTACCA
ACAGCAGAAA ATATTGCTCT TTATATTAGT AATATGTTGC GCTTGCCTAT TCAAGAACTA
GGAGCAGAAC TTTACAAAGT TAAACTAGTT GAAAGTCCTA ACAACTCCTG CGAAATCTAC
CCATCTGACA TAGAATCAAC ATCTGTGATC ACAGTACAGA ATCAGCCTGT TTTAGCGACA
GTTTAA
 
Protein sequence
MQCVVNRRAQ FLASHHYSLP ELGETENLEK FGRSSKFPGQ GHNYTLFISI AGELDKYGMV 
LNLSDVKQVI KREITDQLDF SYLNNVWPEF QQTLSSNENI ARVIWQRLAP HLPLIRVQLV
EHTGLWADYM GEGMQASLSI STHFSAAHRL ASNLSSEKYS KCSRTHGHNY HLEVTVEGEM
DSRTGMIIDL DALNRVVEND VVKIFDHFCV NKDIPHFSEI VPTTENLVLY ISNLLKSPIQ
KLGAKLSQVK LFESPQLWVD YQGNGTETFL TVKSEFSSAH RLAHPGLSLE KNTEIYGKCA
RVNGHGHNYQ LEVTVKGEID SSTGMVVDLG ALNQVIANLT EPLDHSFLNR DVPYFGEVVP
TAENIALYIS NMLRLPIQEL GAELYKVKLV ESPNNSCEIY PSDIESTSVI TVQNQPVLAT
V