Gene Rcas_0655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0655 
Symbol 
ID5538118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp862054 
End bp863442 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content62% 
IMG OID640892812 
ProductVWA containing CoxE family protein 
Protein accessionYP_001430798 
Protein GI156740669 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGC GGGTTGTCGA TTTTATCCAT GCCCTGCGCG CAAAAGGAGT GCGCATTTCG 
CTCGCCGAAA GTATCGACGC AATGCGTTGC GTCGAAATTG CTGGTGTCGC CGATAAACAA
TTCTTCCGTT CGGCGCTGCG CGCGTCGCTG GTGAAAGAGC CACGCGACCT GCCGACGTTC
GATGAACTCT TCCCGCGCTT CTTCGGTCTC GATGCCCCGC CGCCGCTCCA GCAGCCCGGC
GGTGGGATGT CCCCCGAAGA GCGCGAACAA CTGGCGCAGA TGCTGCAACA GATGCTGGCA
TCGCTCACAC CAGAGCAACT CCGGCGTCTG TTCGAGATGA TGATGACCGG GAAGGGCATG
AGCGGCAGCC AGATGCGCCA GTTCATCGAC GAGAACACGA CCGCTACACA GATGACGACC
GGCTTCCAAC CGTGGGCAAC GCGTCGCGCG CTGCGTGAAT TACAGTTCGA CCGCCTGGAG
CAGTTGCTTC AGGAACTGAT CGAGAAACTG CGCGAAGCGG GAGTCGGCGA GGCGGCGCTT
CGGCAACTGG AACAGGAAGC GCGCGAGAAC CGCGCGGCGC TGGCGCAGCA GATCGGCAAC
GAAGTGGGGA ATGGGCTGCA ACAGCGCGAA GCCGAAGAAC GCCGTCGCCG CCCTGCCGAA
GATTTGCAGG ATCGTCCATT CGAGGAACTG ACGTACCGGG ACGATGATGA GATGCGCGCC
GTGATTAACC GCCTGGCCGC ACAGTTGCGC ACGCGCGTCG CGCTGCGGCA GAAACGCGCC
AGCAAGGGCG CCCTCGATGC CAAGAGCACC ATCCGCGCCA ATCTGCGCTA CAGCGGCGTG
CCACTCGACA TTCGCCACCG CCGCAAACAC CTGAAGCCGC GCATTACCGT CATCTGCGAT
GTCTCCGGAT CGATGCGCGC TGTCACCGGC TTTATGCTGA TGCTCGTCTA TGCGTTGCAG
GATCAGATCA GCCGCACCCG CCCCTTTGTG TATTACCGCA CCATTGCCGA TGTGCAGGCT
GATTTTCAGC AACTGCGCCC CGAAGATGCG ATCCGCGTCG TGCCGGAGCG GGTGCAGGGT
GGTCCCTGGC AGACGAGCCT GGGGGCATGT CTGGCGACAT TCACGCGCGA TTATCTCGAC
GCGGTTGATC GCCGCACCAC GGTGATCTTC CTCGGCGATG GTGATGATCA TCTGTCGCCG
CCGAACCCGC GCGCGTTCGA GACCATCAAG CGCCGTGCAC ACCGCGTCGT CTGGTTTAAC
CCCGAACCGC CCTATCGCTG GGGGCGGGAA GACAACCACA TGCACATCTA CGCTCCCATG
TGCGATGCGG TGCATCACGT GAGCAACCTG CGTCAGTTGG TTGCGGCTGT GGACGGGCTG
TTTTCGTAA
 
Protein sequence
MDQRVVDFIH ALRAKGVRIS LAESIDAMRC VEIAGVADKQ FFRSALRASL VKEPRDLPTF 
DELFPRFFGL DAPPPLQQPG GGMSPEEREQ LAQMLQQMLA SLTPEQLRRL FEMMMTGKGM
SGSQMRQFID ENTTATQMTT GFQPWATRRA LRELQFDRLE QLLQELIEKL REAGVGEAAL
RQLEQEAREN RAALAQQIGN EVGNGLQQRE AEERRRRPAE DLQDRPFEEL TYRDDDEMRA
VINRLAAQLR TRVALRQKRA SKGALDAKST IRANLRYSGV PLDIRHRRKH LKPRITVICD
VSGSMRAVTG FMLMLVYALQ DQISRTRPFV YYRTIADVQA DFQQLRPEDA IRVVPERVQG
GPWQTSLGAC LATFTRDYLD AVDRRTTVIF LGDGDDHLSP PNPRAFETIK RRAHRVVWFN
PEPPYRWGRE DNHMHIYAPM CDAVHHVSNL RQLVAAVDGL FS