Gene Rcas_0110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0110 
Symbol 
ID5537569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp131280 
End bp132617 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content63% 
IMG OID640892274 
ProductABC transporter related 
Protein accessionYP_001430264 
Protein GI156740135 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1116] ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCA CGTTTACACC AAAGACGACA GCGAGCGTCG GAAAAGTCAT CCTGAGCGCC 
CGCAATCTGC ATAAAATCTT CAAGACCCCC GAAGGAAACG ATCTGCTGGT TCTGGATACC
ATCAATCTGG ACCTGGCAGA AGGCGAGATT GTCGCACTGC TGGGTCGCTC CGGCTCTGGC
AAGTCGACGT TGCTGCGCTG CCTGATCGGG CTGATCTCCC CGTCGAGCGG CGAGGTGCGC
TACCGCAACC GGTTGGTGAC CGGACCGATG CCGGGGATGG CGATGGTCTT CCAGTCGTTT
GCTCTCTTTC CCTGGCTGAC GGTGCTGGAG AACGTCGAAC TCGGCCTGGA GATGCAGGGC
GTCCCGGAAG GCGAACGGCG GCGGCGGGCG CTGGGGGCGA TTGATCTGAT CGGTCTCGAC
GGATTCGAAA GCGCATATCC GAAGGAGTTG TCCGGTGGCA TGCGCCAGCG AGTCGGGTTT
GCCCGCGCCC TGGTGACCAA CCCCGACGTG CTGTTGATGG ACGAGCCGTT CTCGGCGCTC
GATGTGCTGA CTGCCGAGAA CCTGCGCGCC GAGTTGCTCG ATCTGTGGGA AGAGCGACGC
ATTCCGACGC GGGCAATCCT GATGGTGACG CATAACATCG ATGAGGCGGT CCTGATGGCT
GATCGGGTGC TCATCCTCAG TTCCAATCCG GGACGGATCA TCAGCAGCCA GCAGATCGAC
CTGCCGCGCC CGCGTGATCG CAATGATCCC GCGTTTCTTG CCGCCGTCGA GACGATCTAC
CGGGCGATGA CGACGCCGGA GGCGGCATTT GCCGGCGCCG CAGCGGTTGA GATTGATCAT
GCAAGCCTTG GTATGCGGCT CCCGGAAGCG GAGGTGGCGC AGATAATCGG GTTGATCGAA
CGAGTGGAAG CCGGTCCGGA TCGCGGACGT GACGATCTGC CGGCGCTGGT GGCGGAGATG
CAACTCGATG CGGACGATCT CTTCGTCATC ACCGATGCGG CTGAGTTGCT GGGCTTCGCG
CAGACCCGCG AAGGGGACAT CACCCTGCTA CCGGAAGGCG TGAAACTGGC GCGGAGCGAC
ATCCAGGAGC GCAAGGTCAT TTTTGCCGAG CACCTGATGA ATCGTGTGCC GCTGGTGGCG
CATATCCGGC GCGTGCTCGC CACCCGGCCC GACCATCGTG CGCCGCGTGA GCGGTTCCTT
ACGGAACTGG AAGATTTCAT GGGGGCTGAA GAGGCAGCGC GCACGCTCGA TACGGCAATT
GAATGGGGGC GCTACGCCGA ACTCTTCGAG TACGACGCGC GCGAGGGGCG GCTGCGCCTG
CCGGAGAATG GCGGGTGA
 
Protein sequence
MATTFTPKTT ASVGKVILSA RNLHKIFKTP EGNDLLVLDT INLDLAEGEI VALLGRSGSG 
KSTLLRCLIG LISPSSGEVR YRNRLVTGPM PGMAMVFQSF ALFPWLTVLE NVELGLEMQG
VPEGERRRRA LGAIDLIGLD GFESAYPKEL SGGMRQRVGF ARALVTNPDV LLMDEPFSAL
DVLTAENLRA ELLDLWEERR IPTRAILMVT HNIDEAVLMA DRVLILSSNP GRIISSQQID
LPRPRDRNDP AFLAAVETIY RAMTTPEAAF AGAAAVEIDH ASLGMRLPEA EVAQIIGLIE
RVEAGPDRGR DDLPALVAEM QLDADDLFVI TDAAELLGFA QTREGDITLL PEGVKLARSD
IQERKVIFAE HLMNRVPLVA HIRRVLATRP DHRAPRERFL TELEDFMGAE EAARTLDTAI
EWGRYAELFE YDAREGRLRL PENGG