Gene Aazo_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1100 
Symbol 
ID9338896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1184324 
End bp1185805 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content41% 
IMG OID 
Productpentapeptide repeat-containing protein 
Protein accessionYP_003720573 
Protein GI298490396 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.257041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCTA AAAAGACAGA TGCAACATTA CATTGGTTGC TCACCATCAT CCTTATTGTT 
GCTCTGTCCT TAATACTGAT TGTCTTCGCA TCTACTAATA TTGAGAAGTT ATCACTTCAG
CAGCGAATAG CTAACAAAAA CCAAGCATTA ACCACCACTG CTATAGTCTT TCTGGGTTTA
GCTGTTGCAC TTAATGCTTA TTACGCAGCC AAGCGGAATC AAGTCATGGA AAGAAATGCT
ATTACGGCGG AGAAAACTCT GGCAATTGGC ATTGAAAATA CTAAACTCAC CCAAGACAAA
CTCATTGCAG AACGATTTAT TGGCTCTATT GCTCAGTTAG GACATGAAAA GGTAGAAACG
CGCATAGGTG CTATTTATGC CTTGGAAAGA ATTGCCCAGG ATTTCCCTCA AGAACACTGG
ACAATTATGG AAATTCTCAC TGCTTTTGTT CGTGAAAATG CACCTGTACA ACCGGAGCGA
AAGCCACAAA AACCAGAAGA TATCATGGCG ATTGATTTCG GGAAAAATCG TGACAGAGTG
CGTCGTCAAC AATCAGTAGA TTATTCTCTC TCATGGGAAT CTTTTAAACT TCGTACTGAT
ATTCAAGCTG CTTTGACTGT CATCGGTAGA CGCAATTTTC AACAAGACCG AGAAAATCAA
AAACTGGATT TACGCAATAC TGACATCAGA CGAGTAGACT TAGCAGGAGG TAAACTACAA
AGAGTGGATT TGCGCGGATC TGATTTGTGT GGTGCAGACT TGCGGGAAGT TGATTTAAGT
GAAGCAGACT TGGATGGTGC AAAACTTATT GGTTCGATTC TTTATGAAGC CAACTTATTT
AAAGCGAGTT TACGGGGAGT TAATTTGAAT CGGGCAAATC TGAATCTCGC TAATTTATAT
GGAGTAAACC TACGTTCAGC TAATTTGTGT GGTGCAAGTT TGCGTTCAGC TAATTTACAA
GCTGCTAACT TGTATAAAGC CAATTTGCAA CAAGCAACTC TCAAAGCTGC TAATTTGTCT
GGTGCTAAGT TATTTTTAGC TAACTTGCAA GGGGCGAAAT TGGGTAAAAC TAATTTAAGT
TCAGCCGGCT TGACTGCTGC GAATCTGGAA GGTGCAAATC TCAATGGTGC CAATCTGCAA
GGTGCAAATT TAAACGCTGC AAAATTACAG CAAACGGATA TCTATTTTGC TAATCTCAGT
GAGGCTAGTT TGACAGAAGC AGATCTACAT AATGCTAATT TGATGGGAGC AAATCTTTCT
CTAGCAACGC TTGATGAAGC TGATCTGTCC TGGGCTAATT TGATGGGGGC TAACTTATCA
GGCGCTCATC TTTGTGATGT TAAACTGACT GGAGCGATTT TAACTGGGGC GAAAAACCTG
GAATCTGAGC AGATAGTTAT GGCGTTAGGC GATTGGACTA CTCGTCTGCC TGATTATATC
GATTATATCG AAGCGCCAGC CAGTTGGCTA CAATCTGTTT AA
 
Protein sequence
MSAKKTDATL HWLLTIILIV ALSLILIVFA STNIEKLSLQ QRIANKNQAL TTTAIVFLGL 
AVALNAYYAA KRNQVMERNA ITAEKTLAIG IENTKLTQDK LIAERFIGSI AQLGHEKVET
RIGAIYALER IAQDFPQEHW TIMEILTAFV RENAPVQPER KPQKPEDIMA IDFGKNRDRV
RRQQSVDYSL SWESFKLRTD IQAALTVIGR RNFQQDRENQ KLDLRNTDIR RVDLAGGKLQ
RVDLRGSDLC GADLREVDLS EADLDGAKLI GSILYEANLF KASLRGVNLN RANLNLANLY
GVNLRSANLC GASLRSANLQ AANLYKANLQ QATLKAANLS GAKLFLANLQ GAKLGKTNLS
SAGLTAANLE GANLNGANLQ GANLNAAKLQ QTDIYFANLS EASLTEADLH NANLMGANLS
LATLDEADLS WANLMGANLS GAHLCDVKLT GAILTGAKNL ESEQIVMALG DWTTRLPDYI
DYIEAPASWL QSV