Gene Rcas_3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3520 
Symbol 
ID5541019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4586180 
End bp4587322 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content64% 
IMG OID640895638 
Producthypothetical protein 
Protein accessionYP_001433588 
Protein GI156743459 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTC CTATCAATTC CACCACCTGG AAGCGTCACC GCCGGCTGCT CCTCACTGCA 
CCGCTGGCGC TCGCCGCAGC CGCCGCTGAG TTCTGGGCAG TGCGCGCGCC ACGATCCCTG
CACCTGCCGC CGCAACAACG GGTTGTGACG ATTAACCCGA AGATCGGCAT TCATACGCGC
CTCACTGGCA TCGGCGACGA GGGGTATATC CGGCGCACGC TCGAACAGGT GCGTGAGATG
GGCGCGCGCT GGATCGTTGA TCTCTTCCCT TGGGCATATG TGCAGCCGCG CTCGCGCTTC
GGGTTCGACT GGACCGGCGC CGATCTGGTG GTGCGCCATG CGGCGCGTCA AGGGTTGCAG
GTGATCGCCC GGCTCGACAT TGTGCCGCAG TGGGCGCGCC CGCGCGACTC GAATGACCGC
TACCTCGACG AGGCGCACTA TGCTGATTTT GCCGCATATG CGGCGGCATT CCTTCGGCGC
TACCGCGCCG ATGGCGTGCG CCACATCATC ATCTGGAACG AACCAAACCT GGCATTTGAG
TGGGGACGAC GGACACCCGA TCCGGCCGGC TACGCCGCGT TGCTGAAGGC CGTTTACCCG
CGCGTCAAAT CTGCTGTGCC CGATGCCGTT GTCATTGCCG GAGCGCTCTC ACCCGGCGGC
GATCTTGGCG ACAATGCCGA GGTGCGAATG GGCGATCTGC GCTACATCAC CGAATTGTAC
GCCGCCGGCG CTGCACCCTG GTTCGATGCC TGGGCTGTCC ACAACTATGG CGCGCAGCAA
CCGCACGATG CGCCGCCAGC GCCGGAGGAG GTCAATTTCC GGCGCGTTGA ACTCATCCAC
GACCTGCTCA CCTATCTGGG GGACGGACGC AAACCGATCT TCATTACCGA AGGGGGGTGG
AACGACCACC CGCGCTGGTC AGCGGCGGTG CGCCCGTCGC AACGGGTGCG CTGGACGATT
GGCGCGTACC GGATGGCGCT GGAATGGACC TGGCTGGAAG CAATGTGCCT CTGGCAGTTC
AGCACCCCGT GGCAGGCGCG CACCTATCAG GATAACTGGA ACTTTGTTGC GGCTGATGGA
ACGCCGAAGG CGATCTATTG GGCAGTGCGC GACTATGCCG TGCCGATAGA TCTGCGGCAA
TGA
 
Protein sequence
MNIPINSTTW KRHRRLLLTA PLALAAAAAE FWAVRAPRSL HLPPQQRVVT INPKIGIHTR 
LTGIGDEGYI RRTLEQVREM GARWIVDLFP WAYVQPRSRF GFDWTGADLV VRHAARQGLQ
VIARLDIVPQ WARPRDSNDR YLDEAHYADF AAYAAAFLRR YRADGVRHII IWNEPNLAFE
WGRRTPDPAG YAALLKAVYP RVKSAVPDAV VIAGALSPGG DLGDNAEVRM GDLRYITELY
AAGAAPWFDA WAVHNYGAQQ PHDAPPAPEE VNFRRVELIH DLLTYLGDGR KPIFITEGGW
NDHPRWSAAV RPSQRVRWTI GAYRMALEWT WLEAMCLWQF STPWQARTYQ DNWNFVAADG
TPKAIYWAVR DYAVPIDLRQ