Gene Rsph17025_4342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4342 
Symbol 
ID5086518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009431 
Strand
Start bp99267 
End bp101732 
Gene Length2466 bp 
Protein Length821 aa 
Translation table11 
GC content71% 
IMG OID640485898 
Producthypothetical protein 
Protein accessionYP_001170492 
Protein GI146280336 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.161782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0681295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCAT ATCTCGCACC CGGCGTCTAT GTCGAGGAGA TCCCCTCGGG CCTCAAGCCG 
ATCGAGGCGG CCGGCACCTC GACGGCGGGG ATGATCGGCA TGACCGCGCG CGGGCCGGTC
AACACGCCCA CGCTGGTGAC GAGCCTCGGG GCCTTCTCGC GCGTGTTCGG GGGGCTGCTC
GATCCGCTGG TCTTCGGCGA GGGGCGCGAC GCGCTGCCCT ATGCGGCCGA GGGCTTTTTC
GTGAACGGCG GCGCGCGGCT CTTTGTCGTG CGGATCGTGG GGGAGGATGC GCGCGAGGCG
GTGCTGGATC TTGAGGCGCG CGATACGGCG GTCGAGGGCG CGCCGGTGCT GGCCGAGGCG
GCGGTGGTGG CGGGCGGGGC CGAGGTCACG CGGCTGGTGC TGGCCGATCC CGGCGCGGTG
GCGCCCGAGG CGCGCCTGCT GATCCTGGAC GGAGAGGCCA GCGAGGTGGT GAGCCTGTCG
GACGCCGAGT TCGTGCCGCA GCTTCTGCTG GCGAGCGGGC TGCGCCAGCC CTTTGCGAGC
GGCGACAGCG TGACCCTTCA GACGGTGACG GCGGCGGATG GCGCGCTGGA TGAAGACATG
GCCGCCGGGG CGGGCGAGAT CACCCTGACC TCGACCGACG GGATCGAGGC GGGCGACACG
ATCCGGCTCG GCGAGGGGGC GACGGCGGAG CTGGTCGAGG TGGGGGCGCT CGACGAGGAC
GGGCCGAGGA TCACGCTGGC GCGCCCGGCT GCGCGGGCCC GCGCGCGGGG CGAGGCGGTC
TTCGTGCTGG AGGATGCGGC CACCCACTCC GCGCTGTCGG CGGATGTGGC GGCGGGCGGA
TCGGGCGCGC TGCTGGCGAT GGACGTCACG GGCTTTTCGG CGGGCGACGT GGTCAGGATC
GCGGGGGACG CGGGCGAGGA ACATGCGATG GTCCGCGGGA TCGCGCAGGC GGTGACGCTG
GCCGAGCCGC TGGCCAACAG CCATCCGGCG GGCGTGCGTC TGGTGGCGGC GCGGCCGGTG
CTGCGGCTTC ATGCGCGCTA CCCCGGCCTC TGGGGCAACG AGCTGCGCGC GACCCTGCTG
CCGGCGAACA TCGTTTCCAC CGTGACGCGG GCGCCCGCCG GCGCGGGCGA GGCGTCTGTG
ACGCTCGGCA GCGCCTTCGG CCTCTTTCCG GGGTCGTTCG TGACGCTGGC GGACCCGGAC
CAGCCTGAAA CGGCGCTGCT TTCCCGCGAG GTGGGCGCCG TCGATACCGC ATCGGGCCGC
GTGACCTTCA CCGAGGCGCA CGGGCAGGCG CTGGCCGCGG GGCTTGTGGT GACGAGCCAG
GAGTTCACCC TCGTCGTCGA GCGGATCGAG AAGGGCAAGG CCGTCGAAAG CGAGACCTTC
GAGCGGCTGG CGATGGCGCC GACCCATCCG CGCCATGTGC TGAAGCTGGT GGGAAGCTGG
GACGCGCTCA CGGGGACGCC CTCGGTCTCG GGCGGGTCCA ACCTCATCCG CATCGAGGAT
CTGTCGGACG CCGAAACCCG CGGCCTGCCG CTGGTCATGG GGGTTTCGCG CCGGATGGCG
GAAGGAAACG ACGATGTCGC GGGCGTCACC GAAGCGACCT ATGTCGGGCA GGCCGCCGAG
GATCCGGCGC GACGCCGCGG CATCTTCGCG ATGGAGAACG AGCCGGCGGT CTCGATCGTC
GCCGTGCCGG GGCAGACCTC GGTGACGGTG CAGAAGGCGC TGGTCGATCA CTGCGAGCAC
ATGCGCTACC GCTTTGCCGT GCTCGACACG CCGATCGGCG CCTCGCTTCA GGGCGCGCGC
GAGCATCGGC AGAACTTCGA CAGCACGCGC TGCGCGGTCT ATTACCCCTC GCTGGAGCGC
GCCGACAGCT TCGGGGCGCC GGGGGACCGG CGGATCGTCG CGCCCTCGGG GCATGTGCTG
GGGATCTATG CGCGCACCGA CACGGCGCGC GGGGTCCACA AGGCGCCGGC CAACGAGGTG
GTGCGGGGCG CGTTGGCCTT TGACGTGAAG CTCGACAAGG GCGCGCAGGA CATCCTCAAC
CCGATCAACC TGAACTGCTT CCGCGACTTC CGTTCGGAAA ACCGGGGCCT GCGCCTTTAC
GGCGCGCGGG TCGCCACCTC GGACCCCGAG TTCAAGTATG TCAACGTCCG CCGCCTGCTT
CTGATGATCG AGCAGTCGCT CGACACCGGG CTGCAATGGG CGGTGTTCGA GCCGAATGAC
AAGCCGCTCT GGGATACGGT GAAACAGTCG GTGACGGGGT TCCTCACCAC CGTCTGGCGG
TCCGGCGCGC TGGAGGGGCA GAAGGCCGAG GAGGCCTTCT TCGTCAACAT CGGCTACAAC
GTGACCATGA CCCAGGACGA CATCGACAAC GGCCGGATGA TCGTCGAGAT CGGCGTCGCG
CCGGTGAAGC CCGCCGAGTT CGTCATCGTC AAGATCAGCC AGAAGACCCG CGAAGCGCAG
GGCTGA
 
Protein sequence
MPAYLAPGVY VEEIPSGLKP IEAAGTSTAG MIGMTARGPV NTPTLVTSLG AFSRVFGGLL 
DPLVFGEGRD ALPYAAEGFF VNGGARLFVV RIVGEDAREA VLDLEARDTA VEGAPVLAEA
AVVAGGAEVT RLVLADPGAV APEARLLILD GEASEVVSLS DAEFVPQLLL ASGLRQPFAS
GDSVTLQTVT AADGALDEDM AAGAGEITLT STDGIEAGDT IRLGEGATAE LVEVGALDED
GPRITLARPA ARARARGEAV FVLEDAATHS ALSADVAAGG SGALLAMDVT GFSAGDVVRI
AGDAGEEHAM VRGIAQAVTL AEPLANSHPA GVRLVAARPV LRLHARYPGL WGNELRATLL
PANIVSTVTR APAGAGEASV TLGSAFGLFP GSFVTLADPD QPETALLSRE VGAVDTASGR
VTFTEAHGQA LAAGLVVTSQ EFTLVVERIE KGKAVESETF ERLAMAPTHP RHVLKLVGSW
DALTGTPSVS GGSNLIRIED LSDAETRGLP LVMGVSRRMA EGNDDVAGVT EATYVGQAAE
DPARRRGIFA MENEPAVSIV AVPGQTSVTV QKALVDHCEH MRYRFAVLDT PIGASLQGAR
EHRQNFDSTR CAVYYPSLER ADSFGAPGDR RIVAPSGHVL GIYARTDTAR GVHKAPANEV
VRGALAFDVK LDKGAQDILN PINLNCFRDF RSENRGLRLY GARVATSDPE FKYVNVRRLL
LMIEQSLDTG LQWAVFEPND KPLWDTVKQS VTGFLTTVWR SGALEGQKAE EAFFVNIGYN
VTMTQDDIDN GRMIVEIGVA PVKPAEFVIV KISQKTREAQ G