Gene RoseRS_4104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4104 
Symbol 
ID5211087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5143989 
End bp5145401 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content57% 
IMG OID640597692 
Productextracellular ligand-binding receptor 
Protein accessionYP_001278398 
Protein GI148658193 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCATG TTCGTCTGCT TACGGTTCTC ATGCTGCTGG CAGCGATAAT CCTCGCAGCC 
TGCGGCGCTC CGGCGGCTAC GCCGCCGACG CAACCGCCTG CGCCGACGCA ACCGCCTGCG
CCGACGCAAC CACCTGCGCC GACGCAACCA CCTGCGCCAA CTCCAGCGCC GACCGAAGCA
ACGGCAGCGG CACGCCTGAC GTGCACTGAA CCGGTGAAGG TGGGATTGAT TACAGATGTC
TCTGGGACTC TTGCCATCTA TGGCGCGCAT ATTCTGCGTT CGTTCATGCT CGGCATGGAG
TATATGGCTG GCGCACCAGG CGCAGCAGGC GAGAAGTTCG ATTTCAATAC TGCCAAAGAG
AATACCTTCA AGGTGGATGA CTGCGAGATT CAGGTCTTTG TGCGCGACGA TGCCAGCAAC
CCGCAGAATA CTGCCACTAT CGCACGTGAA CTGATCGATG TGAACAAGGT CAACATCCTG
GTCGGCACTG TTTCGTCAGG TGCGACGGCA ACGCTTCAGG GAATCGCCTT CGAGAACAAG
ATCCCGCTCA TTGTTGCTCC GGCTGCCGCC AATGACATTA CCGGCAAGGA TTTCAATATC
TATACCTTCC GCGTGAGTCG CAACAACTAT CAGGATGCAA TGAACGTCTG CACGTATCTC
ACCCAGAAGT ACAAGAAATT CGTGCAGATT GCGCCGGATT ACGCCTTTGG TCGCGGATCA
GCGGCAGCTT TCCGCGATGC GTGCGGCAAA CTCGGCGGTG AGTTCGTTGC CGACGATATT
TTTGCACCGG CGGATACCAC CGACTTTACG CCTTATATGG AGCGGATTCT GAACTCAGGC
GCGGAGGCGT ACATCGTCAC CTGGGCTGGT TCTGGCTTTG TACCGCTCTT CCAGGCTGCG
AACGATCTCG GTGTGAACGA TAAGATGAGT CTGGCATCCG CTTTCATCGA TAATGCCCTG
ATGCCGGTCT TCTACGGCGG GGCGGTTGGC TCGACCAGCG GCATTGTTTA CCACTACACC
GCGCCGAAGA ATCCGGCGAA TGATTTCCTC GTCGCCAATA CCAAACCACG CTATGGCGTC
AACCCCGACA TTTTCGACGC CGACGGCATG AATGCCGCCA TCTTGCTGGT CGAAGCTTTA
CGCAAGACGG GTAGCGATGT CGGCAGCGAA GCGCTCATCA AAGCGATGGA AGGCATGTCG
TTTGAAGGAC CGAAGGGAAC GGTTTACATC CGTCCCGAAG ACCATGTGGC CATTCAGGAC
ATGTACATCC TCAAACTGAC CAACCTGACC GATCCGGACG CGAAGTTCTA TGAGTATGTC
GCCACAACTC GCCCGGAGCC GCCCTGTCTG CTGCCCGAAA ATCTCAAGGA ACGATGCGGC
AGCCTTCCCT ATGGCAGCCT GAGCGGTCAG TAA
 
Protein sequence
MNHVRLLTVL MLLAAIILAA CGAPAATPPT QPPAPTQPPA PTQPPAPTQP PAPTPAPTEA 
TAAARLTCTE PVKVGLITDV SGTLAIYGAH ILRSFMLGME YMAGAPGAAG EKFDFNTAKE
NTFKVDDCEI QVFVRDDASN PQNTATIARE LIDVNKVNIL VGTVSSGATA TLQGIAFENK
IPLIVAPAAA NDITGKDFNI YTFRVSRNNY QDAMNVCTYL TQKYKKFVQI APDYAFGRGS
AAAFRDACGK LGGEFVADDI FAPADTTDFT PYMERILNSG AEAYIVTWAG SGFVPLFQAA
NDLGVNDKMS LASAFIDNAL MPVFYGGAVG STSGIVYHYT APKNPANDFL VANTKPRYGV
NPDIFDADGM NAAILLVEAL RKTGSDVGSE ALIKAMEGMS FEGPKGTVYI RPEDHVAIQD
MYILKLTNLT DPDAKFYEYV ATTRPEPPCL LPENLKERCG SLPYGSLSGQ