Gene Rcas_0069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0069 
Symbol 
ID5537528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp85876 
End bp88785 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content67% 
IMG OID640892235 
Producthypothetical protein 
Protein accessionYP_001430225 
Protein GI156740096 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0314244 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000262699 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCAGG GAGCGATTCG CAACGGCGCC CCCTCGCCGC CTGCGACCCG TCCGCCGCGC 
CCCGACCTGC CATTGCACTT GGTGCGCGCA CTGGCGTTTG CCTGCATTGC CCTGGCGTTG
ATGCGTCCGC TGGCGTGGCC GCTGATCGCG GCATTCCCCT TCGCGTCGCT GACGGTGTTG
CGTCTGATGA ATGATCCGGC GCTGGCAGCG CTGGCGCTGT TGCATGTCGT GCTGGCGCCG
CCAGTCTTTC CGGCGCTGGT CATTGGCGCC CTGGCAGCGC TGTGGGTGCT GTTGGGCGAA
CTCGTCTTCG CCCTGGCAAC TGCGCTTCTT GCACGCTATC GGGCGCAGCG CGCGCTGCGT
GCGCCGCTGT GTCTCCAGAT TCGTCCGACC GCTTCAGCGC GCACCGGAAC CCCTGCGAAA
CCGGGTGCAC TCATGCGATT GATCCACGGG GCTACGGTGG CGCGTTCCTG GATGCACGCC
GCACCCTGGT ATACTCTGCT GGTCAACGGG GCGCCCGACC TGCCTGCCGA ACTGGGGGCG
TTGATCGCCG GTGCATCCGA GGAGCGCCCA CGGACCGTCA CCGCACTGGA TGGCGCAGTG
CGCAGCAGCG TACCGGAAGC GCTGATTCAC GCCGCCGCCG ACCCGCTGCT GGCAGCGGCA
ACGCCGGGGC GTTGGATTGC CTGGCAGCGC TTCGGGCTGG CGCTGCCGCC CGCCTATCCA
CTTCACGCAC CAACCATCGC CATCGAGTCG GAACTGACCG GGGTGTTGCT GGCCGCAGTA
CGTCCGCAGG CCAGCGTGGC GCACGCCGGT CTGGAAGTGG CGTTGCGCCC TCAGGTGGGT
TGGGAGTTGG GTCGGCAGTG GCGCGCGCGC GCGACAACGC TGAAACTGGC GCTCGAACAG
CGTCAGGACT ATGCGTTGTC GCCGGATGTG GCTGCAATCG AGGCAAAATT GGGCGATGCG
GCGTTCGAGG CGACCATTGT GACAACTGCG GTCGCCGATC AGCGCGCTGA TGCCATTGCT
GCGCTGCTCG CCATCGGCGA TGCGCTCGGC GCCTTCCAGC AACGCACCGC CAGTCGTGTG
CAACGTCTCG TTCCGCACGG GCGCATCTCG GTGCGCCGGG TATCTGAAGG GAACGCTGCG
GACACGATTA TCCGCCTGCG CACGCCACGC ATTGCGCCGC CCCCGGCGCT CCTGTTACCG
TTCCGCCTCT GGCGCGGACC GGACGTGCTG ACAGCGGGGG AACTCGGTTA TCTCTGGAAT
CCGTCCGCTC TGCCGGCGAG CGGACTGGTG CGCAGCGATC CGTGCCGTCG GATTGCGGCG
CCGCCGCACG CCTTCTGCGG CGCCGACCCG GAACGCATTG TGGTCGGCTA TGCCTCCCAC
GCTGATGGAC AACGCGCGCC GGTCGGACCG ACGCTGCGCG ATCTGCGCCA GATACTGCAC
CTGACGGCCG GCATGGGCGC CGGGAAGAGC CGGTTGCTGG CGAACCTGTG CCGGCAACTC
GTTCCGCGCG GATTTATGCT GATCGACGGC AAGGGGGACG ACCGGGACGG CAGTCTCGTG
GCAGTGGTGC GTCGGCTCAT CCCGCCGGCG GACGACGCGC GACTGGTGCT GCTCGACCCG
CTCGATACCG CCTGGCCCAT CGGGCTGAAC CCGCTGGCGG GCGTGGATGC CCGGCGTCCG
GGAGGCGCCG ACCTGGCGCT GGGTCAACTC CTTGCAACGT TCGCGCGCAT CGATCCCGGC
GCCTGGGAAC GGTCGCCGGG GATGCAGCAA TTCGCGCGCA TGGCAGCGTT GTTGGTGCTG
GAAGGCGAAG CGCACCCCAC GCTGGCGCAC GTCAAACAGG CGCTCCTCGA CGAAGCGTAC
CGGGAGGAAT TGCTCCAATC GACGCACAAC ATCGAAGTCG CCAGTTTCTG GCGCGAGACG
TACCCGCGCC TGGGAGAAGG GCAACGATCC AGTTGCGATG CGCTGTTGCG GCGATTCGAC
GCCCTGCTCA CCGCAGAAAC GACGCGCTAC CTGGTTGCGC AGGCACAACC GACGCTCGAT
CTGGCGCGCA TGATGGCCGA CCGTATGATC GTGCTCGCGC CCTTGCCCGA TGTGACGCTG
GGTGGTTTGG CGGGCGCAGT GGGAATGCTG ATCGCCCAGG CCTTCGTGCG TGCTGCCTTC
AGTCGGGGCG GCGATGACCA GACTCGCCAC GACTATCCCC TGATCATCGA CGAATTGCAG
GTGTTAATTG GCGCCGGCGA CACAACCGAC ATAGCGACTG CCATCACGCG CCTGCGGTCC
CTGGGCATCC CGACGATCTA CGCCCATCAG GCATTGGCGC AGTTAGGCGA TCTGCGCGAC
CTGATGCTGA TCAATGCCGG GAACCGCATT ATGCTGCAAA CCCAGGAGCC GGATGCCAGC
GTGTATGCGC GCGCCTACGC CGCCAGCGGG CTGACCGCTG CCGACCTGAG TGGGCAACCG
CCGAACGAGC ATCAGTACGC GGTGTTGCGC TGCGGCGGGC TGGTCGCAGG ACCATTTTCG
ATGCAACCGC TGCCCTGGCC GACGGTGGAG GAGGAGGCGC CGCCGCCCTA CGTCGGACCG
GCATGGCGCG ATGTTCTTCC CGATGATGGC GATCCGGCGG ATCGCTTTAT TGCACAGGTG
ATCTACACGG CAGACGACAG CGCCGGATCT GCCCGCGAAC TGGCGCGGCT TGATGAGGCG
GACTGGGAAC GACTGCTGCG GCGCTGGGAG TGCATCCGCG CGCAGCAGCG CCAGTACATT
CTGGCGCATC CCGGTTGCAT TCCTGACCGG CGGGAGCGAC AACGCTGGCT CTCGCGGCTG
TATGCAGCGC GTCCGCGAGT GCTGGCGGCT GCTGAGTACC TGCGCGGACG CCAAAAAGGA
TCGCATCAGA AAGCATTAAG CAAGATTTGA
 
Protein sequence
MKQGAIRNGA PSPPATRPPR PDLPLHLVRA LAFACIALAL MRPLAWPLIA AFPFASLTVL 
RLMNDPALAA LALLHVVLAP PVFPALVIGA LAALWVLLGE LVFALATALL ARYRAQRALR
APLCLQIRPT ASARTGTPAK PGALMRLIHG ATVARSWMHA APWYTLLVNG APDLPAELGA
LIAGASEERP RTVTALDGAV RSSVPEALIH AAADPLLAAA TPGRWIAWQR FGLALPPAYP
LHAPTIAIES ELTGVLLAAV RPQASVAHAG LEVALRPQVG WELGRQWRAR ATTLKLALEQ
RQDYALSPDV AAIEAKLGDA AFEATIVTTA VADQRADAIA ALLAIGDALG AFQQRTASRV
QRLVPHGRIS VRRVSEGNAA DTIIRLRTPR IAPPPALLLP FRLWRGPDVL TAGELGYLWN
PSALPASGLV RSDPCRRIAA PPHAFCGADP ERIVVGYASH ADGQRAPVGP TLRDLRQILH
LTAGMGAGKS RLLANLCRQL VPRGFMLIDG KGDDRDGSLV AVVRRLIPPA DDARLVLLDP
LDTAWPIGLN PLAGVDARRP GGADLALGQL LATFARIDPG AWERSPGMQQ FARMAALLVL
EGEAHPTLAH VKQALLDEAY REELLQSTHN IEVASFWRET YPRLGEGQRS SCDALLRRFD
ALLTAETTRY LVAQAQPTLD LARMMADRMI VLAPLPDVTL GGLAGAVGML IAQAFVRAAF
SRGGDDQTRH DYPLIIDELQ VLIGAGDTTD IATAITRLRS LGIPTIYAHQ ALAQLGDLRD
LMLINAGNRI MLQTQEPDAS VYARAYAASG LTAADLSGQP PNEHQYAVLR CGGLVAGPFS
MQPLPWPTVE EEAPPPYVGP AWRDVLPDDG DPADRFIAQV IYTADDSAGS ARELARLDEA
DWERLLRRWE CIRAQQRQYI LAHPGCIPDR RERQRWLSRL YAARPRVLAA AEYLRGRQKG
SHQKALSKI