Gene Nther_2235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2235 
Symbol 
ID6315236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2372195 
End bp2373181 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content43% 
IMG OID642644623 
Productflagellar hook-associated protein 3 
Protein accessionYP_001918389 
Protein GI188586844 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID[TIGR02550] flagellar hook-associated protein 3 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.000208019 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGGTGA CCAATAAAAT GATGAGTGAC AATATGTTGA GGAATCTGAA CACAAATTTG 
CGGGATATGA ATCATACCCA GAACAAGCTT TCCACCGGAA AAAGAATTTT AAGTCCTTCC
GATGATCCAG CGGGAACTGC CAGAACCCTG GACCTTCGGA CTGAGGAAAC TGAACTGGCT
AAATACAAGC AAAATGTTGA TGATGCCGAT TCCTGGTTGA CCAGTACTGA TTCCGCTTTA
GATGAAGTTG ACGATGTTCT ACAAAGGGTT AGAGAACTAA CTATATATGC AGCCAGTGAC
AGTGTGGATC AGCAGTCCAG AGAGGCCCTG GCAGCAGAAG TTATGGAATT AAAGGAACAC
TTAGTGGAAG TTGCCAATAC TGACTTTGGT GGTAAGCACA TTTTTGGAGG GCATAATACT
ACTGATAAAC CCTTTGATAT GGATGACGTT GAAGAAATGA ACGGCGAAGA AGGAGAATAC
AATACCTTTG ATGTAGAGTA CTCCGGAAAT CGAGGACGCC TTAATACCGA CATCAGCTCC
GATGTAACGA TTTCTAAAAA CCTGCACGGT GAAGAAGTTT TTGGCTCTTT CGATGAAGAA
GAAAACGGAG AAAATGGTGA GAACGGTGAT GAGGCCATCT CACAAAATAT GTTTAAAATG
CTGGACGGGG TTTATGATAG CATGATGGAA GATGATGGCG GTGGAACAGA AAAACTCTCC
AACGAACACT TGCAAGATCT CGATCACTGG ATTGAGAACA ACTTGGATAA TAGAGCAGAA
GTCGGAGCAA GACAAAACAG GTTAGAGCTA AGTAAAAACC GCCTACAAGA TATCGAACAT
CTAACCAAAG AAGACTTATC CGAAACGGAA GAAGCGGATA TGGCCAAGAC CATCATGGAC
TTAAAGAGCC AGGAAAACGT GCACCGAATG GCTCTGTCAG CAGGTGCTAG AATAATTCAG
CCCACTTTGT TGGATTTTCT TCAATAG
 
Protein sequence
MRVTNKMMSD NMLRNLNTNL RDMNHTQNKL STGKRILSPS DDPAGTARTL DLRTEETELA 
KYKQNVDDAD SWLTSTDSAL DEVDDVLQRV RELTIYAASD SVDQQSREAL AAEVMELKEH
LVEVANTDFG GKHIFGGHNT TDKPFDMDDV EEMNGEEGEY NTFDVEYSGN RGRLNTDISS
DVTISKNLHG EEVFGSFDEE ENGENGENGD EAISQNMFKM LDGVYDSMME DDGGGTEKLS
NEHLQDLDHW IENNLDNRAE VGARQNRLEL SKNRLQDIEH LTKEDLSETE EADMAKTIMD
LKSQENVHRM ALSAGARIIQ PTLLDFLQ