Gene Aazo_4268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4268 
Symbol 
ID9342072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4339769 
End bp4340845 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content41% 
IMG OID 
ProductPilT protein domain-containing protein 
Protein accessionYP_003722765 
Protein GI298492588 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.040207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGATA TCATCATCAT TCTCTCGTTT ATTTTGGCAG CATCGGGAAT AGGTTACTTT 
AGTACCGATC TACTCCCCCC TGGAACACTA GACGGGGTAA CAAACCTAAA CGCATTGCGG
TTAATTGTTG CTGTATTTGC CGCTATTATT GGTGGTGCAA TTGGGCTAAG TTTCCAGACC
ACATACCGTC GCTTAGAAAC ACAAGTCCGA GAAATGCCCC TCGAAGTCAT TTTAACTCGT
GCCATTGGCT TAGTCATTGG GCTCTTACTA GCCAATTTAA TGTTAGCCCC GCTATTTTTA
CTACCCATTC CCCCAGATTT TGGATTTATT AAACCTCTAG TAGCAGTTGT CGGCAGTATT
ATACTTGCTG TCACAGGCAT GAATTTGGCA GATACTCATG GACGCGGTTT ATTACGCTTA
ATTAATCCCA ACACTGTAGA AACAATGGTA GTTGAAGGAA CTCTTAAACC TGCAAACACC
AAAGTTTTAG ATACCAGTTG CATTATTGAT GGTCGTATTG AAACCTTATT GGAAACTGGT
TTTCTAGAAG GGGTAATTAT TGTCCCACAG TTTATTTTAC AAGAACTGCA ACAAGTAGCG
GATGCCAGTA AAGACCAAAA GCGAGTTAGA GGAAGACGCG GATTAGAAAT CCTCAACCGG
ATTAAAGAGG CTTACCCAGA ACGCATTTTA ATTAACGCGG CTGATTACGA AGATATTGCT
ACAGTTGATG CTAAATTGGT GCGATTTGCC CAGGAAATTA ATGGGACTCT ATTAACTAAT
GACTACAATT TATCTAAAGT TGCTAGTGTG CAGAAAGTTC CAGTTTTAAA TATCAATGAT
TTGGTAAATG CAGTTCGTCC ATCTTATTTA CCTGGTGATA ATCTAGATTT GAAAATTCTC
AAAGAAGGTA AAGAACCAAG TCAAGGTGTT GGTTATTTAG ACGACGGCAC AATGGTAGTA
GTTGAGGAAG GAAGCAGTTA TGTAGGTGGT GAACTGCGGG TAGTTGTCAC CAGTGCTTTA
CAAACCTCAG CGGGGAGGAT GATTTTTGCT AAACCCCAAG CTTCCGCATT AGCGTGA
 
Protein sequence
MLDIIIILSF ILAASGIGYF STDLLPPGTL DGVTNLNALR LIVAVFAAII GGAIGLSFQT 
TYRRLETQVR EMPLEVILTR AIGLVIGLLL ANLMLAPLFL LPIPPDFGFI KPLVAVVGSI
ILAVTGMNLA DTHGRGLLRL INPNTVETMV VEGTLKPANT KVLDTSCIID GRIETLLETG
FLEGVIIVPQ FILQELQQVA DASKDQKRVR GRRGLEILNR IKEAYPERIL INAADYEDIA
TVDAKLVRFA QEINGTLLTN DYNLSKVASV QKVPVLNIND LVNAVRPSYL PGDNLDLKIL
KEGKEPSQGV GYLDDGTMVV VEEGSSYVGG ELRVVVTSAL QTSAGRMIFA KPQASALA