Gene Aazo_3466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3466 
Symbol 
ID9341270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3537017 
End bp3538141 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content38% 
IMG OID 
ProductTPR repeat-containing protein 
Protein accessionYP_003722214 
Protein GI298492037 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAAAG AATTGCAATC TAATCAGTCT TTATCCAGTC TAATTAAACC ATCATTCTAC 
AGAAATTCTG TACAGAGATT AGCAATTCTG TTTGCTTCCT CTGCATTTTT AATGTTGGCT
TTACCGACAA TTAATTTAAC GGGTAGTAAA TTGTTAGCGC AGAATGCGGT TTCTCAGGAT
TTAGATGCGG CTATATTATA CCAATTGGGA GTGACACGCT ATAACCGCAG AGACTTACAA
AGTGCAGAAT ATGCCTTTCG TCAAGCTTTA CAACGAGATA GTAATATTGG GTTAGCACGG
AATTATTTAG GTAATATTTT GATGGAGCAA AATCGCCTAG ATTTAGCTGT ACAAGAATAT
GGAGAAGCAA TTAGACTGAT TCCCAATTTT GGTGAATTTT ATTATAATTT AGGGTTGGCT
TTGCAAAAAC AGGGACAGAA AGAAGCGGCA ATTGCTGCTT ATCGCCAAGC CTTAGCAGCA
AATCCAAAAA TGGCAGCGGC TCAGTATAAT TTGGGAGTGA TATTGTATGA AGAGGAACGG
TGCCAAGAAG CGATCGCCGC TTACCAGGAA GCTATTAATT TAGACCGAAA CAATGCTAAT
GCTTACTTTA ATTTAGCGAT CGCTCTGCAA CAAGAAGGTC AATTAGAACA AGCGATCGCT
ACTTATCGTC AAATCTTAAA GCTAAATCCT GAAAATACTG TAGCTTACAA TAATTTAGGT
AGTCTCATGG TAATTCAAGG TCAGCCCTCA GAAGCTATTG CCATCTACCA AAAAGCTATT
GGCCAAAATC CCAAAAATGC CTTAGCTTAC TATAACTTAG GAGTAACTTT ATACAATCAA
GGTAATTTAA AAGAAGCTAA TGCCGCATTA AAACGCGCTC GTCAAGAATA TGGTGAACAA
GGAAACACTG AAAGAACCAC CACAATTGAC GACATGATTC AGAAAATTAG CGAGTATTTA
ACACCTAAAA AACCTCAACA CACACAAACC ACAACTCCCA CTCCAAATAA CAGTGATGTA
GTGCTACCAA CACCGGAAAT GCAAACACCG CAAATGCAAA CACCTGAACA ACAAAAGCCA
GAAAAATCTA CTCATGTGCC TAAAACAGTT GAACAAAAAC CATAG
 
Protein sequence
MSKELQSNQS LSSLIKPSFY RNSVQRLAIL FASSAFLMLA LPTINLTGSK LLAQNAVSQD 
LDAAILYQLG VTRYNRRDLQ SAEYAFRQAL QRDSNIGLAR NYLGNILMEQ NRLDLAVQEY
GEAIRLIPNF GEFYYNLGLA LQKQGQKEAA IAAYRQALAA NPKMAAAQYN LGVILYEEER
CQEAIAAYQE AINLDRNNAN AYFNLAIALQ QEGQLEQAIA TYRQILKLNP ENTVAYNNLG
SLMVIQGQPS EAIAIYQKAI GQNPKNALAY YNLGVTLYNQ GNLKEANAAL KRARQEYGEQ
GNTERTTTID DMIQKISEYL TPKKPQHTQT TTPTPNNSDV VLPTPEMQTP QMQTPEQQKP
EKSTHVPKTV EQKP