Gene RSP_2997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2997 
Symbol 
ID3720248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp1693482 
End bp1694708 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content50% 
IMG OID640071191 
Productputative head portal protein 
Protein accessionYP_353064 
Protein GI77463560 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.429219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGGAT TTGGCAAACG AGAGTCAGGC AGCAATCAGC CGACCGTTAT TAGTAGGATC 
TCTGAGGCGT TCGGCTGGTG GGGCGGATCG TCTTCTATTG CCCCGGCACT AAGCAATACA
ACTGCGATGC AGAACCCGGC TGTTATGTGC GCAGTCCGAA CAATCGCGGA AGGTGTAGCT
TCCATGCCTA TCAATATTAT CGAGACAAAA GAAGTAGACG GGCTGTCAAA GCGAACAATT
CGGAAAGATC ATTGGGCGTC AAAGCTGATT AATAAGCCAA ATGCCTATCA GACCCGATTT
GAATTTGTTG AAATGATGAT TTCAAATGCC GTGCTCGGAA AAGGCGCATT GGCACTCAAA
ACCGTTGTCG GTGGAGAAGT CCGCGAACTC TTGCCTATCC CTAGCGGTAT TTGGGAAATG
GAAATCCTCA CTAATGGATC ATACAATTTC CGGGTAAGGT TTACCGATGG TTCCAGCCGC
GTATTCGCAG CTAAGGATTG TCTATTCTTC CGTGGTTTGT CGCTTGACGG GTATTCGTCT
ATCTCCGCTA TTGAGACCGC CAGAAAGGCT GTCGGTATCG CGAACGCCCT TGAAGGCCAG
ACTCTTCAGA CGGCTTCGAA TGGTGGAAGA CCTTCAGGTG TCTTGAGCAT CGGTGATCCA
GAAGACGGCG TTGCTCTGGA TGAAGATACC CGTGCCAAAA TCATCGCACT TTGGAAGGAC
CGATTCTCAT CGAATGGGGA AGGCGGTATC CTGATTTCAT CTGGATATTC GACCGACTTC
AAACCGATCC AACAGAACGC GGTTGATAGC CAACTTATCG AAAGCCGCAA GTATCAGGTC
GAAGAGATAG CTCGCATCTT CCGGGTGCAT CCGGCTTATC TGATGGCGTC CGGGACTATC
ACTCCCGAGA TCCAACGGGC GCATGTCCGC AATACCCTCA TGCCTTGGGT AGCTCGTTTT
GAACAAGCAT TAGCAGCGTC ACTGCTCCAA GCCGAACCAA ATCTGTTGTT TGATTTTGAT
GAGCACGAAT TACTTCGCGG GGACCATTCT GCCCTAAAAG ATTTCTTCGC ATCAGTGACG
GGCGTTGGTG GAAGTCCTGC AATCATGTCG GTCAACGAAT GCCGTTATGA ATTGGGCCTT
GATCCTATTG CGGATGAATG GGCCAGAACT CCGCTCAAAG GCGGGTATGA AAACTCCGCT
ATTCAGAAAG AGGAAAGCAG CAAATGA
 
Protein sequence
MFGFGKRESG SNQPTVISRI SEAFGWWGGS SSIAPALSNT TAMQNPAVMC AVRTIAEGVA 
SMPINIIETK EVDGLSKRTI RKDHWASKLI NKPNAYQTRF EFVEMMISNA VLGKGALALK
TVVGGEVREL LPIPSGIWEM EILTNGSYNF RVRFTDGSSR VFAAKDCLFF RGLSLDGYSS
ISAIETARKA VGIANALEGQ TLQTASNGGR PSGVLSIGDP EDGVALDEDT RAKIIALWKD
RFSSNGEGGI LISSGYSTDF KPIQQNAVDS QLIESRKYQV EEIARIFRVH PAYLMASGTI
TPEIQRAHVR NTLMPWVARF EQALAASLLQ AEPNLLFDFD EHELLRGDHS ALKDFFASVT
GVGGSPAIMS VNECRYELGL DPIADEWART PLKGGYENSA IQKEESSK