Gene Rcas_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1996 
Symbol 
ID5539474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2562894 
End bp2564477 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content62% 
IMG OID640894131 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_001432102 
Protein GI156741973 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.177975 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGGACG ATCATCAACG ACGTTCTATC CGTGGCACAA CGACGCGTGA GCGCATTGCA 
GCACGACGGC GGGCGCGTCG CGCCAGCCAA TGGTCTGGGT TCGCGCTGGC ACTCGTGGCG
CTTGCCGTGC TCGCGTTGTT GACCATCGGA GTGACGGCGC TGCGACGCGC CGAACGAACT
CTGGCAGAAC TCGAACAGAA TGATCCACGT CAGCGCGCGA CTGCCACCTC ACCATCCCCG
ACCACGGCGC CACAAACTCC GGCTTCCAGC ACGGTAGCAT CGCCGGCGCC TGCGGAATAC
GGGTCAGCGA ACGACTTGCT TGCCCGACCA TTTACGGTGC TGCTCCTCGG CGTCGACCGC
CGCACCGACC CCGACGAAGG GGTGCGCAGT GATACCCTGA TGCTCGTGCG TGTCGATCCA
CAGGCGCGCA CCATCAGTAT GCTCTCAATC CCACGTGACT CGGTCGTGCC TGTCCCACGC
CTTGGTTGGG CGAAGATCAA CGCGGCATAT GGGTATGGGT ACGCCAACGC TGCAACGCTC
TATGGCAACA GCGTCGAACC ATCGGCTGCT GGCGGCGCAC TCGCAGCCGA AGCGGTCGAA
CAGTTTCTCG GTGTTACCAT CGATTATATT GCGCAGGTCG ATTTTCGTGG TTTCGAGCGC
CTGGTCGATT CGGTCGGCGG CGTCTTGATC GATGTGCCCG CCCCTGTTCT CGATGCAGAG
TATCCGACCG AAAACTACGG CGTCGAGCGG ATCTACATCC CGGCAGGACT GCAAGTGTTC
GATGGACGCA CAGCGTTAAT CTACGCTCGC ACGCGCCACG GCAGCAGTGA TTTCGAGCGC
AGTAAACGTC AGCAGCAGGT GTTGCGCGCG CTTTTCGATC AGGTGCGCAA CCGCGGGCTG
CTGGCGAATA CGACCCTCCT GCCACGCTGG ATTGAGGTGC TCACCCAACA CGTGCGCACG
ACATTACCGG TGTCCGACCT TCGGGCAATG GCAGAACTTG CAGCGCTGGC GCGCGACCTG
GACAGCAGTC GCATTCTGCA ACTGTCGATC AACCCAAATG ATGTGGCCAT CGACCGTGAA
GACGGCTCAG ACATTTACTG GAATCCGAAC GATATTGCGG CGCTGGTGGC GCGCTGGGAA
GCCGGTCCCG ACCTGACCGC CACGCCGCCG ATTGCGGGTC TGCCGACCCA GGTTGCTGTA
CCCGACGCAT CGCCGGATGA TCCGTTGCAC GGCTCGGTGC CAACGCTTCT CCCATCAGTC
GTCGAGCCGG GCGCCGTTGT TCAGGTGCTC AATGGCGCGC GCGTCGAGGG GATTGCCGGG
CGCGTCAGCG CATTTCTGGA AAAGCGCGGA TTCGTCGTCG CCGACCCGGA AACAGTCACC
CGCGTCTACG AGCATACCCT GATTATCGAC TATACCGGAC GCCCCGAAAC ACGTCGTCTT
CTGGCAGAGG CGCTCGGTGT CAATGCTCGC TATGTCCTGG CGCCCGCCCC GCCTGATGCG
CCGCCGCCGG GGTATAATGT CGATATTGTC GTCATTGTCG GGCGCGATTA TCGGCAGGAA
TGGCTGAACG ACGGCGGGAG GTGA
 
Protein sequence
MADDHQRRSI RGTTTRERIA ARRRARRASQ WSGFALALVA LAVLALLTIG VTALRRAERT 
LAELEQNDPR QRATATSPSP TTAPQTPASS TVASPAPAEY GSANDLLARP FTVLLLGVDR
RTDPDEGVRS DTLMLVRVDP QARTISMLSI PRDSVVPVPR LGWAKINAAY GYGYANAATL
YGNSVEPSAA GGALAAEAVE QFLGVTIDYI AQVDFRGFER LVDSVGGVLI DVPAPVLDAE
YPTENYGVER IYIPAGLQVF DGRTALIYAR TRHGSSDFER SKRQQQVLRA LFDQVRNRGL
LANTTLLPRW IEVLTQHVRT TLPVSDLRAM AELAALARDL DSSRILQLSI NPNDVAIDRE
DGSDIYWNPN DIAALVARWE AGPDLTATPP IAGLPTQVAV PDASPDDPLH GSVPTLLPSV
VEPGAVVQVL NGARVEGIAG RVSAFLEKRG FVVADPETVT RVYEHTLIID YTGRPETRRL
LAEALGVNAR YVLAPAPPDA PPPGYNVDIV VIVGRDYRQE WLNDGGR