Gene Aazo_0189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0189 
Symbol 
ID9337974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp179738 
End bp181084 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content42% 
IMG OID 
Productpentapeptide repeat-containing protein 
Protein accessionYP_003719938 
Protein GI298489761 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.130578 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATTG CATCTAATTC TTCTCATCCA CCAAAGCCAG AAACGAATAT AGATAAAGAC 
TTGCAGCCAG ATGATTTTGA TGCAAGTGAA AATGGACTAA CTTCAGAAGG TTTGGCCGCA
CAACAAGCCT TATCAGCAAT TTCTTCTTTA CAATCTCCCC AACATACAAA TGCTCTCAAA
CAAGCGACAT CTGGTTTTAA AGACATTTCT CATCATCAAC TTGCTGTCAA GCCTAGAGCA
TTATTATTTA CTCTACTAGC GATCGCACTC ACTTTTATTG GTATTGCTAT TAATAACTCC
TTTCTGGGCA TTTTAGGAAC TCTCACAACT TTGGTGTTAT CTGTGGCTAT ACTTTTACCT
TGGTTGCAAG ACGTCGTTCA AGAATGGTTT TCTGCCCAAG AAAGAACAGT TTTGGTGGGT
TTGACGGGCT TATTAGTAGC AATTTGTGGC TTATTCAGGT TTACTGGTGT CGAAAATGGA
CTACTCCGCT GGGGAAGCAA GATTAACTGG GATATTGCGG GTACTTTAGC AGATTGGTTT
GGCGCTTTAG GGCAAATTTC CATAGCTATC ATCGCTGTTT ACGTAGCTTG GCGACAATAT
GTAATTTCTA AAGACTTAAC TATTCAACAA AACCTGCTGA CAGTACAACA AAATATTATT
ACCCAACAGC AAACAATAGA TTCTTATTTC CAAGGTGTTT CTGACTTGGT ACTGGACGAA
GAGGGATTAT TAGAAGACTG GCCACAAGAA AGAGCGATCG CAGAAGGACG AACTGCCGCA
ATTTTTAGTA GTGTAGATGG TAGTGGTAAA GCCAAAATTC TCCGTTTTCT CTCCCGTTCA
AAATTACTCA CACCATTAAA ACGCGATCGT CGTTTAGGTA GAGCGATTCT TGACGGTATC
GGTGGCTACG CAGAAGACCT TTTAGAAGGT GTGCGCGTCA TTGACTTAGG TGTAATGTTA
GCAGGTGCAG ACCTGTCGAA CACTGATTTA CGCTGGACTG ATTTAAGCGA AGCGAATCTT
GTCCGTGCTA ATCTCAGCGG TTGTGATTTA GTCAAAGCCA ACCTATCCCG CACTATCTTA
TATGATGCGG ATCTCAACAA TAGCGATTTA AATGGAGTGC GTTTCTTTTA TGGTTCATTA
GAAAAAGCCT CACCCCGCAG TCGCAACAAC CCACCCAACT ATGAAACAGG GGAACACACC
GGCGCAGTTG TGGAAAATGC CGATTTCAGA AACGCACAAC GGATGTCCGA ATCAACCCGT
CAATACTGCT GTGCTTGGTG TGGAGAAGAA GCCAGACGGA CTATTCCTGG TGGTTGTGAA
GGTATTCCCA ATAAATTGGG TAGATAA
 
Protein sequence
MTIASNSSHP PKPETNIDKD LQPDDFDASE NGLTSEGLAA QQALSAISSL QSPQHTNALK 
QATSGFKDIS HHQLAVKPRA LLFTLLAIAL TFIGIAINNS FLGILGTLTT LVLSVAILLP
WLQDVVQEWF SAQERTVLVG LTGLLVAICG LFRFTGVENG LLRWGSKINW DIAGTLADWF
GALGQISIAI IAVYVAWRQY VISKDLTIQQ NLLTVQQNII TQQQTIDSYF QGVSDLVLDE
EGLLEDWPQE RAIAEGRTAA IFSSVDGSGK AKILRFLSRS KLLTPLKRDR RLGRAILDGI
GGYAEDLLEG VRVIDLGVML AGADLSNTDL RWTDLSEANL VRANLSGCDL VKANLSRTIL
YDADLNNSDL NGVRFFYGSL EKASPRSRNN PPNYETGEHT GAVVENADFR NAQRMSESTR
QYCCAWCGEE ARRTIPGGCE GIPNKLGR