Gene Aazo_3113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3113 
Symbol 
ID9340916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3201299 
End bp3203425 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content38% 
IMG OID 
ProductNHL repeat-containing protein 
Protein accessionYP_003721974 
Protein GI298491797 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTTCA ATTTTTTACT CGGTCTAACT TGTGGATTAA TGGTGATATT TGTGGGTATA 
ACAATGGACT CAGCCACGAC AAAAACCACT ACAGTAGCTG GATTTACCTA TGAAACTTCT
TGGCTTGGTA ATACTATCGG CAAAGGTAAA TTACGAGTCC AAAATAATGT AGAGGCAATG
TATGTAACAA CTAATGGTAA AATTTATACT AATAGTTATT GGGATGAAGC TGGAGGAGAA
GCAGGAATAT ATAAAGATGG TAAGGTTATG GGTTTTCTAG AAGATATTCA TGGGTGGAAT
CGTCTCGGTG GTAAAGCAGT AACAGCCAAT AGCAAATATA TTTATATTGC TATGTCCCAA
AGTGGAATGA ATAATGGAAA AGACGGTTAT CCACCAGAAG GTAAAATTTG GTACTGTGTC
AGACGTTATA ATTTAGCCGG AAAATCTGTA CCTTTTCCGG GAGGTAATGG CTGGGATAAA
AGTATGTTAA TTATCAGTGA TAAACATGAA GTAACTGGAT TAGCAACTGT AGGTGATAAA
TTATTTATTA GTAATGCTGC TGGTAACATT GTACAGATTT ACAATACTGA AACAATGCAG
GAGATAGGCA AATTCGCTGT TACCAATCCT GGAGGAATAG CCATTGATCC ACAAGGAGAT
TTGTGGATTA TTCAAACTAA AAAAGGTACT ATGGTTGGTC AGATTGTCCA TTATTCGCAA
ACAGGAAAAA AACTGCCGCA ACAAATTGTA GATGTGGTTG ATCCAACAGC GATCGCAACT
TACAGACAAG ATAGGTTATT AGTAGCAGAA AATGGCATCC GTCAACAAGT GCTGATTTAC
AATATCGAAA ATCAGCCTGT ACAAGTAGGA ACATTAGGGA CTGAAAATGG TATCTATAGT
GGTGTTCCTG GTGAAGTTGA AAGTTTAAAA TTATATGGAA TTACGGGAGT CGGTACAGAC
AGCACAGGCA ATATCTATAT CAACAATAAT GGTTTTAATA AATCAGGTAC AGATTTAAGA
AAATTTTCCG AATCAGGGAA ACAACAATGG CAATTATTGG GTTTAACATT TGTAGATAAT
GCAGATACTG ACCCCAAAAG TGATGGTGTA GATGTATTCA CCAAGCATGA AAAATACCTG
ATAGATTATA GTAAACCATC TGGTAAACAA TGGACTTATA AAGTCTACAC ACTTAATCCT
TTTAAATATC CTCAAGATCC TCGTTTACAT ACATCCCCAG ATGGCCCTAT TTTCCGACGT
ATTCAAGGGA GGCCTTTTTT ATTCCTCACA GATATGTTTG GCAGCTTACT GCAGATTTAT
CGTTTTCAAC CAACTACAGA CGGTAATATT GCTATTCCTG CGGGGCTATT TGTGGTAACT
AATGATAAAG GTAAATCTAT TCAGGGTAAT TGGCCTCCCT ATCAACCAGC CAAAGGTGAA
TGGATTTGGC GAGATAAAAA TGGTAATGGT GCATTTGATC AAAATGAATA TGACAGCAGT
GAAGATTATC CCTACATAGG TGGTTGGTGG GTAGATACTA AAGGTGATGT TTGGAAAACT
TTGCGAACTG AAGATGGTAT TCGTCATTAT CCTTTACAGG GATTAGATAG CAAAGGAAAT
CCCATCTATA CTTATACTTC AATACAAAAA CACAAAACTC CTAGTGTGTT TAAAGATTTG
CGGCATATTG AGTATTTTCC TGAAACAGAT ACTATGTATT TATCAGGTTT CACAGCAGCA
CATCCCGCTT CTGGTGATGA TACCAGGGTT ATAGGATCAG AAATTGCCCG GTTTGATAAT
TGGAGTAAAG GAAATCGCAC TCCTAAATGG CGGACTGTAG TTCCTTATGA TTCTACTGGG
AAGCGTGAAG TTTCTACTGC GGCCATCAGT GTAGCTGGTG ATTATGTGTT TGCAGTGACG
GTGAAAACCG CAGAAGTTTA TGTCTATAAA ACTGCTACAG GCAAACTTGT ACTCAAGTTC
GGTCCTGGTC CAGAAGTTGG TGGAGAAAGT GGCTGGGTTG ATATCCCCCA CGGTATCCGG
GCTTTTCGCC GTGCTAATGG TGAATACTTA GTGTTTGCAG AGGAAAATAT GAACGGAAAG
GTGATTATTT ACCGTTTCTC AATGTGA
 
Protein sequence
MAFNFLLGLT CGLMVIFVGI TMDSATTKTT TVAGFTYETS WLGNTIGKGK LRVQNNVEAM 
YVTTNGKIYT NSYWDEAGGE AGIYKDGKVM GFLEDIHGWN RLGGKAVTAN SKYIYIAMSQ
SGMNNGKDGY PPEGKIWYCV RRYNLAGKSV PFPGGNGWDK SMLIISDKHE VTGLATVGDK
LFISNAAGNI VQIYNTETMQ EIGKFAVTNP GGIAIDPQGD LWIIQTKKGT MVGQIVHYSQ
TGKKLPQQIV DVVDPTAIAT YRQDRLLVAE NGIRQQVLIY NIENQPVQVG TLGTENGIYS
GVPGEVESLK LYGITGVGTD STGNIYINNN GFNKSGTDLR KFSESGKQQW QLLGLTFVDN
ADTDPKSDGV DVFTKHEKYL IDYSKPSGKQ WTYKVYTLNP FKYPQDPRLH TSPDGPIFRR
IQGRPFLFLT DMFGSLLQIY RFQPTTDGNI AIPAGLFVVT NDKGKSIQGN WPPYQPAKGE
WIWRDKNGNG AFDQNEYDSS EDYPYIGGWW VDTKGDVWKT LRTEDGIRHY PLQGLDSKGN
PIYTYTSIQK HKTPSVFKDL RHIEYFPETD TMYLSGFTAA HPASGDDTRV IGSEIARFDN
WSKGNRTPKW RTVVPYDSTG KREVSTAAIS VAGDYVFAVT VKTAEVYVYK TATGKLVLKF
GPGPEVGGES GWVDIPHGIR AFRRANGEYL VFAEENMNGK VIIYRFSM