Gene Rcas_3033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3033 
Symbol 
ID5540529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3930825 
End bp3932663 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content62% 
IMG OID640895153 
Productcitrate transporter 
Protein accessionYP_001433106 
Protein GI156742977 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTGA TTGTCTTTGG AGTGCTGGCG CTGACCATAA TCCTGTTCGC CAGTGATCGG 
CTCCGCCTCG ATGTGGTTGC GCTGCTGGCG TTGCTGGCCC TGCTGTTGAC GGGAATCCTC
ACGCCAGCCG AGGCGCTCGC CGGATTTTCC GACCCGATTG TGCTGATCAT CGCCGGGTTG
TTTATCGTCG GCGCCGGATT GTTTCAAACC GGCGTCGCCG ATGCGTTGGG GCAGCAACTG
ATGCGCTTCG CCGGCGCCGG TGAAGCGCGC CTGATCGCCT CACTGATGCT CATTGTTGCG
TTGCTTTCGG CATTTCTCAG TTCAACCGGA ACCGTCGCTG TTTTTCTGCC GGTGGCGGTG
AGTCTGGCGC GACGCGCCGG CGTCAGCCCG GCGAAACTGC TGCTGCCGCT CGCCTATGGT
TCGCTGATCG GGGGCCTGCT GACCCTTATT GGAACGCCAC CCAATATCGT CGTCAGCAAT
CAATTGCAGG CAGCCGGACG CGCGCCGTTC GGGTTCTTTT CCTTTACGCC GATTGGACTC
GTGATGCTGG CGATCGGCAT TGGGTACATG ATTACCGTCG GGCGACACAT GTTGCCCGTG
CGCGCGCACC TTGCGGCTGC GTCAGGAAAC GGCAAACCAA TGGTCGATCC GGCGACATTG
CTGGCATTGT ACGACCTCCC CGGCAAACTG GCGCGCGTGC AGATCGAGCC TTCGTCGCCG
CTGGTTGGTC AAACGCTGGC ACAGGCGAGT TTACGCACCC GCTACCGCAT CAATGTCGTG
GACGTTGAGC CACGTGTTCG CCAGGGGGCA ACTGCCGCGC CACACATCGC CGATTCGGGT
GTAACGCTGC AACCGAGGGA TGTGCTGCTG GTCAAAGGCT CGGCGGAGGA TATTGCGCGG
CTGGCAAGGG AACAGCAAAT GCGCGTGCTG GCGACCGGCG TCAGCCCTGA CGATCTGATC
ACCGAGGAAA CCGGCATTGT GGAACTGGTG CTGACGCCAC GCTCACGCTT GATCGGGAAG
TCGCTGCGCG AGACGCGCTT CCAGGACACC TATCGGGTGC TGGCGCTGGC AATTCTGCGA
TTGGGCGCGC CGCTCGATGC CCCAACATCA CAGGTGGAAC TGCGATTTGG CGACACGTTG
CTGGTTCAGG GAACATGGGA ACGGATCACC TCGCTACTCG ATGAACGCAA CGATTTTGTG
GTTGTCGGCG AGGTGCATCG TCCACCGACA AAACGCGCTC TAACCCGACG CGCCCCCGTT
GCGCTGGCAA TCATGCTGGG CATGCTGATT CTCATTTCGC TCGATATACT GCCCATGGTG
ACCGCCGTGC TGCTTGCCGC CGTCGCTATG GTGCTGACCG GCTGCGTGTC GATGGAAGAA
GGGTATCGGG CGATCAACTG GGAAAGCGTT GTGCTGATTG CCGGAATGCT GCCGATGGCG
ACAGCGCTCG ACAAGACCGG CGGATTGCAA CTGATGGCGA GTGGGTTGAC AGCAACGCTC
GGCGCACTGG GTCCGCTAGC GCTTATGGCG GGGCTGTTCA CGCTCACGGC GCTCTTCAGT
CAGTTCATTT CCAACACTGC AACCACCGTG CTGATGGCGC CGATCGCGTT GCAGGCTGCC
GCAGAACTCG GCGTCTCTCC CTACCCGCTG CTCATGATCG TCGCCATAGC CGCATCGACC
GCCTTCGCCA CCCCAATTGC CTCGCCGGTC AACACGCTGG TGCTCGGACC GGGGGATTAC
CGCTTCACCG ATTTTGTGCG GGTCGGAACG CCGCTGCTCG CGCTGACGTT GATCGCGTCG
CTCGTGGCTG TGCCGGTGGT GTTTCCGCTA TGGTTGTAG
 
Protein sequence
MMLIVFGVLA LTIILFASDR LRLDVVALLA LLALLLTGIL TPAEALAGFS DPIVLIIAGL 
FIVGAGLFQT GVADALGQQL MRFAGAGEAR LIASLMLIVA LLSAFLSSTG TVAVFLPVAV
SLARRAGVSP AKLLLPLAYG SLIGGLLTLI GTPPNIVVSN QLQAAGRAPF GFFSFTPIGL
VMLAIGIGYM ITVGRHMLPV RAHLAAASGN GKPMVDPATL LALYDLPGKL ARVQIEPSSP
LVGQTLAQAS LRTRYRINVV DVEPRVRQGA TAAPHIADSG VTLQPRDVLL VKGSAEDIAR
LAREQQMRVL ATGVSPDDLI TEETGIVELV LTPRSRLIGK SLRETRFQDT YRVLALAILR
LGAPLDAPTS QVELRFGDTL LVQGTWERIT SLLDERNDFV VVGEVHRPPT KRALTRRAPV
ALAIMLGMLI LISLDILPMV TAVLLAAVAM VLTGCVSMEE GYRAINWESV VLIAGMLPMA
TALDKTGGLQ LMASGLTATL GALGPLALMA GLFTLTALFS QFISNTATTV LMAPIALQAA
AELGVSPYPL LMIVAIAAST AFATPIASPV NTLVLGPGDY RFTDFVRVGT PLLALTLIAS
LVAVPVVFPL WL