Gene RoseRS_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3131 
Symbol 
ID5210101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3931908 
End bp3933107 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content62% 
IMG OID640596723 
ProductNB-ARC domain-containing protein 
Protein accessionYP_001277443 
Protein GI148657238 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00106243 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACAGTG ATCAGCCACA GCAGAACCAG GATATCAGCG GCGCAGGCGC TGCTGGCGTG 
GTCGGTGACG CGACGAATAG CCCAATCACC ACCGGCAACA ACAACACCAT TATTCAGTCT
GGCCGCGATA CTGTACTGTA TACCAATCTC CCGCCACCCC CTGCGCCTGG CAGCGTCCCA
CTGAAGCCTG CCCTCATCAT CGGGCGCGAC AATGATCTCC AGGCGCTGAA GCAACGGGTT
GGCATCTCGG CCAAAGGCGA AAGTCCTGCC CCGTTGAAGG TGCTGACCGC AATACGCGGC
TGGCCGGGTG TCGGCAAGAC CACCCTGGCG GCTGCGCTCG CACACGATCC GGATATCAAC
GCGCGTTTCC CTGATGGAGT GCTGTGGGCC TCGCTTGGCC AACAGCCAGG ACTCTTCGGT
GAACTGGCCG CCTGGGGACG TGCCCTTGGT GTGCCGGATC TTAACCAGGC GCATACCGTT
GAAGAAGCAT CTAACCTGCT GCGCGGCCTG CTGCGCAACA AGCGCTACCT TCTGATTGTC
GATGACGCCT GGCAGGCTGA GCACGTGGTA CCCTTCAATG TCGGCGGCAG TGGTTGCGCG
TTGCTCATCA CCACGCGCCT GCCAGAAGTG GCCCGCGCAA TCGCCCCAAC GCCCGATGAT
GTCTATGTGC TCGGTGTGTT GAGCGAGACC GATGCGCTGG CGCTGCTGCG CACCTTAGCG
CCGACGGTAG TTGCAGAGAA CGAGGCGACC AGCCGCGAAC TGGTCAAGGA TCTGGAAGGT
CTGCCGCTGG CTATTCAGGT GGCCGGTCGG CTGCTCCACA CTGAAGCAAG CTATGGCTTT
GGCGTCGAGC AATTATTGGT CGATATCCGC GAGGGCGCAA GGCTGATCCA GGCTCAGGCG
CCAGCTGATC GCGCCGAAGT TGCCAGAGAG ACAACGCCCA CCGTTGCGGC GCTCCTGAAA
AAGAGCACCG ATCTGCTGGA TGCCCACACG CTCGACTGCT TCGCCTACCT TGGCGCCTTC
GCGCCGAAAC CGGCCACCTT TGACGCTGCG GCGATGCAAG CCGTGTGGGA GGTTGACGAC
CCAAGGCCGA TCATCCGCAC CCTGGTTGAC CGCGGCCTGC TAGAGCCCTC AGGCCAGGGC
CGCTTCTGGA TGCACGCGGT ACTCGTATCT CATGCAAAGT CGTTCTTGAC AGAGGAGTGA
 
Protein sequence
MDSDQPQQNQ DISGAGAAGV VGDATNSPIT TGNNNTIIQS GRDTVLYTNL PPPPAPGSVP 
LKPALIIGRD NDLQALKQRV GISAKGESPA PLKVLTAIRG WPGVGKTTLA AALAHDPDIN
ARFPDGVLWA SLGQQPGLFG ELAAWGRALG VPDLNQAHTV EEASNLLRGL LRNKRYLLIV
DDAWQAEHVV PFNVGGSGCA LLITTRLPEV ARAIAPTPDD VYVLGVLSET DALALLRTLA
PTVVAENEAT SRELVKDLEG LPLAIQVAGR LLHTEASYGF GVEQLLVDIR EGARLIQAQA
PADRAEVARE TTPTVAALLK KSTDLLDAHT LDCFAYLGAF APKPATFDAA AMQAVWEVDD
PRPIIRTLVD RGLLEPSGQG RFWMHAVLVS HAKSFLTEE