Gene Aazo_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1939 
Symbol 
ID9339732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2021660 
End bp2023108 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content46% 
IMG OID 
ProductATP synthase F1 subunit beta 
Protein accessionYP_003721151 
Protein GI298490974 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCACCA CCGCAGAAAA AACAAACATA GGTTACATTA CCCAAATCAT TGGTCCAGTT 
GTAGACGTTA AGTTCCCCGG CGGTAAATTA CCCCAAATCT ACAACGCTTT GACCATCACA
GGCACTAACG AAGCTGGACA AAACATCAGC TTGACCGTTG AAGTACAGCA ATTGCTAGGC
GACAACCAAG TTAGAGCCGT GGCGATGAGT AGCACTGACG GATTAGTACG TGGTTTGGAA
GCCGTTGATA CTGGCGCTCC CATCACCGTA CCCGTAGGTA AAGCTACTCT GGGTAGAATT
TTCAACGTTT TGGGCAATCC TGTAGATAAC CAAGGACCTG TAAACGCTGA AGCCAGACTA
CCCATCCACC GTGATGCTCC TAAATTCACA GAATTGGAAA CCAAACCTTC TGTGTTTGAA
ACTGGGATTA AAGTTGTTGA CTTGCTGACT CCCTACCGAC GCGGCGGTAA AATTGGTCTG
TTCGGGGGTG CTGGTGTTGG TAAAACCGTG ATCATGATGG AATTGATCAA CAACATTGCT
ACCCAACATG GTGGTGTGTC TGTATTCGCA GGTGTGGGTG AACGTACTCG TGAAGGTAAT
GACCTCTACA ATGAAATGAT TGAATCTGGG GTTATCAACA AAGACAACCT CAATGAATCT
AAGATTGCTC TAGTTTACGG TCAAATGAAC GAACCACCCG GAGCTAGAAT GCGGGTTGGT
TTGTCTGGTT TGACAATGGC TGAGTATTTC CGTGATGTGA ACAAGCAAGA CGTATTGCTG
TTTGTTGACA ATATTTTCCG GTTTGTACAA GCTGGTTCTG AAGTATCCGC ACTATTGGGA
CGGATGCCTT CTGCGGTAGG ATATCAGCCT ACTCTGGGTA CAGACGTTGG TGCATTGCAA
GAACGGATTA CTTCTACCAC CGAAGGTTCT ATTACTTCTA TTCAAGCTGT ATATGTACCT
GCGGATGACT TGACTGACCC CGCACCTGCA ACTACCTTTG CTCACTTGGA TGGTACAACA
GTATTGTCTC GTGGTTTGGC AGCTAAGGGT ATCTATCCAG CGGTTGATCC TCTGGGTTCT
ACTTCCACCA TGCTACAGCC AAACATTGTT GGTGATGAAC ACTACAACAC TGCTCGCAGT
GTACAATCAA CTCTACAACG TTACAAAGAA CTACAAGACA TCATCGCTAT TCTGGGTTTA
GATGAATTGT CTGAAGAAGA CCGTCTGATT GTAGCACGGG CGCGGAAAGT TGAGCGTTTC
TTGTCTCAGC CTTTCTTCGT AGCTGAAGTA TTTACTGGTT CTCCTGGTAA GTATGTGAAG
TTGGAAGACA CCATCAAAGG TTTCCAGAAG ATTCTCTCCG GTGAGTTGGA TGCTTTACCA
GAGCAGGCTT TCTACTTGGT AGGCGATATT AACGAAGCAA TCGCAAAAGC TGAAAAGCTC
AAAGGTTAA
 
Protein sequence
MVTTAEKTNI GYITQIIGPV VDVKFPGGKL PQIYNALTIT GTNEAGQNIS LTVEVQQLLG 
DNQVRAVAMS STDGLVRGLE AVDTGAPITV PVGKATLGRI FNVLGNPVDN QGPVNAEARL
PIHRDAPKFT ELETKPSVFE TGIKVVDLLT PYRRGGKIGL FGGAGVGKTV IMMELINNIA
TQHGGVSVFA GVGERTREGN DLYNEMIESG VINKDNLNES KIALVYGQMN EPPGARMRVG
LSGLTMAEYF RDVNKQDVLL FVDNIFRFVQ AGSEVSALLG RMPSAVGYQP TLGTDVGALQ
ERITSTTEGS ITSIQAVYVP ADDLTDPAPA TTFAHLDGTT VLSRGLAAKG IYPAVDPLGS
TSTMLQPNIV GDEHYNTARS VQSTLQRYKE LQDIIAILGL DELSEEDRLI VARARKVERF
LSQPFFVAEV FTGSPGKYVK LEDTIKGFQK ILSGELDALP EQAFYLVGDI NEAIAKAEKL
KG