Gene Rcas_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2018 
Symbol 
ID5539496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2587963 
End bp2589057 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content58% 
IMG OID640894153 
ProductPilT domain-containing protein 
Protein accessionYP_001432124 
Protein GI156741995 
COG category[R] General function prediction only 
COG ID[COG4956] Integral membrane protein (PIN domain superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.625918 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATCA GTCTTAATTT TATCGTTCGC CTGATCGGGA TGTTCGCGCT GGGGTATGCA 
GGTTTTCGCA TTGGCATCAC GCTATCGGGT GAACCTCCAA CCGAGATTGA GGTCCTGGCG
ACCCAGTTGC TGGCGCTGGC AGGCGCCGGC TTGGGTCTGT TGACCACCCA CCGCTGGACG
GTCGAGCCGG TTCGTGATCT GATTCGTCAT ATGCGAAGCG TCTCGATTGC CGAATTGACG
GCGCTGGTAT TCGGGGCGCT GGTTGGGCTG ATTTTTGGCG TGCTCTTGTC TGTTCCACTG
GCGCAACTGC CGCCGCCGCT CGGTCAGTTT GGTCCAATCG TCGCTGCCGG TGCGCTGGCG
TATCTTGGTG CGATCCTCTT CTCCAACCGT AAGAAAGATA TTGCGGATAT GCTCCTCGCC
TCGCGCCGCG GAACCTTCTC CTGGTCGCAA CAGGTCGGCG ATGCTATCCA GCCGCCGCGT
CGGTATCTGA TTGATACCTC AGCCATTGTT GATGGACGCA TTGCGGCTGT AGCGCAGACG
GGCTTTGTCG ATGGAACATT GGTCGTGCCC GACTTCGTGT TGCACGAGTT GCAATCGCTG
GCGGACTCTG CCGATGAACT GCGGCGGATG AAAGGGCGGC GCGGGCTTGA GATTCTGAAC
ACGATGCAGA AACAAATGAA TAGCGCCGTG GAAGTGCTGA ACGCCGACAT TCCCGGCACT
ATGGACGTGG ACGAGAAACT CGTCATTCTG GCGCGCCAGT ATCGCTGCCC GATTATTACG
AATGATAATA ACCTTGGGCG CGTTGCGGAA CTCCAGGGGG TCAGGGTTCT GAGCCTGAAC
CATCTGGCAG ACGCCGTTCG TCCGCCGGTC ATTCCCGGTC AGGACCTGCG TGTGACAATC
CGCGATATTG GGCGTGAGCG TGAACAGGGG ATTTCGTTTC TGGAAGATGG CACGATGGTG
GTGGTCGAAG ACGCGCGGCG TCTGATTGGG CGCGAGGTGG ATACGATTGT CACGCGCGTC
TATCAGACGC AGACCGGTCG GATCGTGTTT GCACAACTGC GGCTGGAGAA TGTGGTGAAG
CAGGGACCGG TCTGA
 
Protein sequence
MRISLNFIVR LIGMFALGYA GFRIGITLSG EPPTEIEVLA TQLLALAGAG LGLLTTHRWT 
VEPVRDLIRH MRSVSIAELT ALVFGALVGL IFGVLLSVPL AQLPPPLGQF GPIVAAGALA
YLGAILFSNR KKDIADMLLA SRRGTFSWSQ QVGDAIQPPR RYLIDTSAIV DGRIAAVAQT
GFVDGTLVVP DFVLHELQSL ADSADELRRM KGRRGLEILN TMQKQMNSAV EVLNADIPGT
MDVDEKLVIL ARQYRCPIIT NDNNLGRVAE LQGVRVLSLN HLADAVRPPV IPGQDLRVTI
RDIGREREQG ISFLEDGTMV VVEDARRLIG REVDTIVTRV YQTQTGRIVF AQLRLENVVK
QGPV