Gene Aazo_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1969 
Symbol 
ID9339762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2050197 
End bp2051321 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content41% 
IMG OID 
Producttwitching motility protein 
Protein accessionYP_003721170 
Protein GI298490993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATGA TCATTGAAGA CTTGATGGAA CAGTTGATTG ACATGGGTGG TTCCGATCTG 
CACTTATCCG CAGGTTTACC TCCCTACTTC CGCGTGAGTG GTCATCTGAC TCCTATTGGT
GATCATGTGC TGACTGCAGA TGAATGCCAA AGATTAATTT TTAGTATGCT CAATAACACC
CAGCGAAAAA CCTTAGAACA AACCTGGGAA TTAGATTGTT CCTATGGTGT CAAAGGATTA
GCTCGTTTTC GGGTGAATGT CTACAAAGAA CGTGGTGCTT ATGCTGCTTG TTTGCGAGCA
TTGAGTTCTA AAATTCCTAA CTTCGAAAAA TTAGGTTTAC CAAATATAGT TCGGGAAATG
TCTGAAAAAC CCAGAGGATT AATTCTGGTG ACAGGCCCAA CAGGTTCAGG TAAAACCACT
ACTCTAGCAG CAATGATTGA CCTCATTAAC CGCACTAGAG CAGAACATAT TTTAACCGTT
GAAGACCCTG TAGAATTTAT TTATGAACCA ATTAAAAGCC TAGTTCACCA ACGTCAACTA
GGGGAAGACA CCAAGAGTTT TGCTAATGCC TTGAAAGCAG CTTTGCGGGA AGATCCAGAT
ATCATTCTGG TAGGAGAAAT GCGGGATTTG GACACAATTG CTTTGGCAAT TTCTGCCGCA
GAAACAGGTC ACTTAGTCTT TGGTACAATG CACACCAGTT CTGCTGCCCA AACAGTTGAC
CGGATCATCG ACGTTTTCCC GCATGAAAAA CAAACCCAAG TACGGGTACA ATTATCAAAT
TCTTTAGTAG CAGTATTTAG CCAAACTTTA GTATCTAAGA AAAATCCCAA ACCAGGTGAA
TATGGTCGGG TGATGGCTCA AGAAATTATG GTAATCACTC CTGCTATTTC CAACTTAATT
CGTGAAGGTA AAACAGCGCA AATTTATTCA GCCATTCAAA CTGGTGGTAA ATTGGGAATG
CAAACTTTAG AGAAGGTTTT AGCTGATTTA TATAAAGCTG GAACTATCTC TTTTGAAGCG
ACTATGTCTA AGACTTCTAA GCCTGATGAA GTTCAACGTC TCATTGGGTC AGCACCACCA
CAAGCATCAG CAGCAAAATC TGGTACTGCT GCTAAAGCTC ATTAG
 
Protein sequence
MEMIIEDLME QLIDMGGSDL HLSAGLPPYF RVSGHLTPIG DHVLTADECQ RLIFSMLNNT 
QRKTLEQTWE LDCSYGVKGL ARFRVNVYKE RGAYAACLRA LSSKIPNFEK LGLPNIVREM
SEKPRGLILV TGPTGSGKTT TLAAMIDLIN RTRAEHILTV EDPVEFIYEP IKSLVHQRQL
GEDTKSFANA LKAALREDPD IILVGEMRDL DTIALAISAA ETGHLVFGTM HTSSAAQTVD
RIIDVFPHEK QTQVRVQLSN SLVAVFSQTL VSKKNPKPGE YGRVMAQEIM VITPAISNLI
REGKTAQIYS AIQTGGKLGM QTLEKVLADL YKAGTISFEA TMSKTSKPDE VQRLIGSAPP
QASAAKSGTA AKAH