Gene Phep_3070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3070 
Symbol 
ID8254187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3668807 
End bp3670654 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content42% 
IMG OID644936723 
Producttranscription termination factor Rho 
Protein accessionYP_003093329 
Protein GI255532957 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.581675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00476927 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTAATA AAACAGAATT AAATGAAAAA CTCACTGCTG AATTGCGTGA GATAGCAAAA 
AATCAGGGCA TCCTAAATGC CGATGAATTG CGCAAGGCTG AACTGGTCGA AATCATTGCT
CAAATAACCG AGCAGCAAGC CGAAGCAGAA GCTACTCCCA AAGAAGCTCC GGTAAAAGCT
GTAAAAACAA AAGCTGTAAA GGAAAATGCA GCAAAAAACA GCCCTGTTAA GAACAGTGGG
AACAAAGCGC CCGTCCAAAA AGAATTGCCT AATGATGAAA ATACTTTAAC AGCAGATTCT
GCTAAAGCAA CTCCGTTAAA AGAGCGCCCT GTAAAGGAAC ATTTAGCAAA AACTAGCGCA
GCCAATAACC CGTCAAAAGA TGCCCAGGGT GATAAACCGG CTAGAAAGAG AATCCGCATT
GCACCAAAGG AGGCAGCTGA AGAAAATGTA CAAAGGCCAT TTAACCGGGC TTCTTTGTTT
GAGCCCCATC CTGTAGCTGA AGCTTACAAA CAGGAGACAC TTTTTGAAGC ACCTGTTGAA
GTTGTAACTG AAGAAGCTCC GGCAGTAACA CCAGCGGAAA CAACAGTACC TTCTTTTCCA
AACGAAACTA AAAACCCTGC ACATCCGGCA AACCAGGACA ATAAAAAACA GCGGCCACTG
GAAAACAAAC CTAAAAATCA AAACAGCAAC GGCAACCATA AGCAAAACGA AAACAGCTAC
TCCAATCTTG ATTTCGACAA TACCATTACC AACGAAGGTG TTTTGGAAAT TATGCCTGAT
GGATATGGCT TTTTAAGATC TGCCGACTAC AATTACCTTT CTTCGCCGGA TGATATTTAT
GTATCACAAT CGCAGATCAA ATTATTCGGT CTGAAAACAG GTGATACCGT AAAAGGAAGC
ATCCGGCCAC CAAAAGAAGG TGAAAAATAC TTTCCGCTGG TAAGGGTTGA AACCATCAAT
GGCCGTTTGC CCGCTGATGT GCGTGACCGT GTTCCTTTCG ATTACCTTAC CCCATTGTTC
CCTACAGAAA GGTTAAATCT TTTTACAGAA ACAAATAACT ATTCCACCCG TATTATAGAC
CTGTTTACAC CTATAGGTAA AGGACAACGT GGCTTAATTG TAGCCCAGCC TAAAACTGGT
AAAACAAACC TGCTGAAAGA GGTAGCCAAT GCCATTGCTA AAAACCATCC TGAAGTTTAT
CTGATCATTT TATTAATCGA TGAGCGCCCG GAAGAGGTAA CAGACATGGC CAGAAGTGTA
AGGGCTGAAG TAATTGCCTC TACTTTTGAT GAACCGGCAG AACGCCACGT TAAAATCGCC
AATATTGTAC TCGAAAAAGC CAAACGTCTG GTAGAATGTG GCCATGATGT AGTGATCCTG
CTCGATTCCA TTACCAGATT GGCCAGAGCT TACAATACAA CGGCCCCTGC CTCAGGAAAA
ATCTTGTCAG GTGGTGTTGA TGCAAATGCC CTTCACAAAC CTAAGCGTTT CTTTGGTGCA
GCACGTAACA TCGAAAAAGG TGGCTCATTA ACCATACTTG CTACTGCTTT AACAGATACA
GGATCGAAAA TGGATGAAGT GATCTTTGAA GAATTTAAAG GTACTGGTAA TATGGAACTT
CAGTTAGATC GTAAATTATC TAACAAACGT ATTTTCCCTG CTATCGACAT TACAGCTTCA
AGTACACGCC GCGACGATCT GTTGCACGAC AGAGATACTT TGCAGCGGGT ATGGATCCTG
CGCAACCACC TGGCAGACAT GAACGCCCAG GAAGCTATGG AGTTTGTTCA GGCTCAAATA
AAAGGCACCA AATCCAATGA AGAATTCTTA ATTTCAATGA ATAGCTAA
 
Protein sequence
MFNKTELNEK LTAELREIAK NQGILNADEL RKAELVEIIA QITEQQAEAE ATPKEAPVKA 
VKTKAVKENA AKNSPVKNSG NKAPVQKELP NDENTLTADS AKATPLKERP VKEHLAKTSA
ANNPSKDAQG DKPARKRIRI APKEAAEENV QRPFNRASLF EPHPVAEAYK QETLFEAPVE
VVTEEAPAVT PAETTVPSFP NETKNPAHPA NQDNKKQRPL ENKPKNQNSN GNHKQNENSY
SNLDFDNTIT NEGVLEIMPD GYGFLRSADY NYLSSPDDIY VSQSQIKLFG LKTGDTVKGS
IRPPKEGEKY FPLVRVETIN GRLPADVRDR VPFDYLTPLF PTERLNLFTE TNNYSTRIID
LFTPIGKGQR GLIVAQPKTG KTNLLKEVAN AIAKNHPEVY LIILLIDERP EEVTDMARSV
RAEVIASTFD EPAERHVKIA NIVLEKAKRL VECGHDVVIL LDSITRLARA YNTTAPASGK
ILSGGVDANA LHKPKRFFGA ARNIEKGGSL TILATALTDT GSKMDEVIFE EFKGTGNMEL
QLDRKLSNKR IFPAIDITAS STRRDDLLHD RDTLQRVWIL RNHLADMNAQ EAMEFVQAQI
KGTKSNEEFL ISMNS