Gene Rcas_2388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2388 
Symbol 
ID5539869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3076177 
End bp3077223 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content59% 
IMG OID640894520 
ProductNMT1/THI5-like domain-containing protein 
Protein accessionYP_001432488 
Protein GI156742359 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.689816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCATC GCCTGGTTAT CATACTGATG ATCATAGTTG CACTGGCGGC GGCTGGATGC 
GGTGCAACGC CAGCCGCAAC GCCGACTATG CCGGCCGCTG CGCCGCCGCC AACCGAATCG
GCGACCCTGC GCCCAATAGT GATGGGATTC CCGTATATTC CGAATGTGCA ATTCGCCCAT
TTCTACCTGG CGGATGCGAA AGGGTACTAT GAAGCCGAAG GATTGGACGT CGCCTTCGAT
TACAATTTTG AGACCGATGT GGTGCAGCGC GTGGCGCAGG GAACATTGCA GTTCGCGCTG
GCGTCGGGCG ATTCGGTGCT GCTGGCGCGT TCGCAAGGTT TGCCGATTGT CACAGTGATG
ACGAATAGCC AGCGCTTCCC GACGGTGCTT TTCAGCAAAG CGGAAGCGAA CATCACTACG
CCAAAGGACC TGACGCGCGA CGGGGTGACG GTTGGCATTC CAGGGCGCTT CGGCGCCAGC
TGGATCGGTT TGCTGGCGTT GCTCTACGCT GAGAACATCC CGCGAGAAGC GGTCAACGTT
CAAGAGATCG GTTTCACGCA GGTGGCGGCG ATCACCGAGG GGAAAGTGAC GGTTGCAACC
GGGTACGGCA ACAACGAGCC GATTCAACTG GAGCGGCAGG GCATTCCGGT GAATGTCATC
CGTATCGCCG ATTATTTCCC GCTGGCATCC GACGGGCTGA TTACCGGTGA GCAACTCGTT
GCCGGCGATC CCGACGTGGT GCGCAAGTTC GTGCGGGCAA CCCTGCGTGG CATGGCGGAT
GTGATCGCCG ACCCTGACGC TGCATTCACC ACTGCTCTCG ATTACATCCC CGAACTCAAG
GGCGCCGATC AATCGACGCA GGACCTTCAG CGCGCCGTGC TCCAGGCGAC GCTCGACTAC
TGGCAGAGCG ACAAAACGAA GACCGAGGGG CTGGGGTTCT GCGATGAAAC GAACTGGCGC
GAAACCTACG TCTTCCTGCG TGAGAGCGGT CTGCTGGCGA CCGATGTGGA CGTAACGAAG
GCATTTACCA ATCAGTTCAT CAAGTAG
 
Protein sequence
MMHRLVIILM IIVALAAAGC GATPAATPTM PAAAPPPTES ATLRPIVMGF PYIPNVQFAH 
FYLADAKGYY EAEGLDVAFD YNFETDVVQR VAQGTLQFAL ASGDSVLLAR SQGLPIVTVM
TNSQRFPTVL FSKAEANITT PKDLTRDGVT VGIPGRFGAS WIGLLALLYA ENIPREAVNV
QEIGFTQVAA ITEGKVTVAT GYGNNEPIQL ERQGIPVNVI RIADYFPLAS DGLITGEQLV
AGDPDVVRKF VRATLRGMAD VIADPDAAFT TALDYIPELK GADQSTQDLQ RAVLQATLDY
WQSDKTKTEG LGFCDETNWR ETYVFLRESG LLATDVDVTK AFTNQFIK