Gene Phep_3366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3366 
Symbol 
ID8254485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3994458 
End bp3996059 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content43% 
IMG OID644937018 
Producthypothetical protein 
Protein accessionYP_003093622 
Protein GI255533250 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAC AAATTATAAT TATAGCGCTT ATAGCGTTTG GCTTATTGTC TTGTACCAAA 
AATTTTGAAG AAATCAATAC AGATCCGAAC AAACCTGTGA AAGTGCAACC GGATTTTTTA
CTGACCACTT CCATTTTTGA AACAATGAAC CTGTTTGGCG GCAATATGAA CCGTGTGGTA
TTTTTTAACT ATACACACCA TTTCTCTGGC TTTCAGGGTA ATTTTCAGCG CTACAACTAC
GACCTGAACG AGGACAATAC TTATTGGCGC GCAGTTTATG TACAAGCCGC CCAACCTGTA
AACCAGATTA TTGTAAATTA TAAAAATGAT CCTGCTTATA CCAACAGGGT GATGATTGCG
CGCATATGGA AAAATTATAT CCTATCCAAT GCAGTGGTGA TATGGGGAAG TGTTCCTACG
GAAGGTGCCT TGCTGGGTAC GCCAAGTGTT CCTTATACCA AAGAACAGGA TGTATATGTG
AACGTATTGG CTGATCTTAA AAATTTAACG GATTCATTAA GCCTTACCGG TGATAAATAT
ACTGTAAATG CAGATAAGAT TTTTGGCGGT GACCTTTTAA AATGGAAGAA ATTTGCAAAT
ACACTGAGGT TAAGACTGGC CATCCGGATT TCTAACGATG CGCCCAATGG CGATCCTGTG
TTAGCCAAAA GGGTAGTAGA AGAAGTTTTT CAAACGGAAC AGTATACGAT GAAGGCCCAG
ACTGAAACGG CTGCGGCAAA CTGGGGCACA ACCAGTGATA CCTGGAGTCC GCTTTACGAC
AGGGCGGTAT ACAATTATAC TGCCAACAAG GCCACTATTC CGGTTACCAA TGAGTCGCTG
GTTTACCACA TGGCGCCTTA CAACGATGCA CGGTTAACTA TATATGCACA GCCGGCGAAA
CAAGGCCCGC AAACCGGAAC TTACTTTGGA CAGAACATCT CTTATGGAGG GGGATCAACT
TATGCTAACG GTTTAACCAA TCCGCATACC GGCTTAAAAC AAGATGACTA TTCGGCTATA
GGTGAGCGGT TTTTAAAGCC CGATGCGGAA TATGTGTTCC TTTCATATGC TGAAGCTTGT
TTTTTAAAGG CGGAAGCTGC GTTAAAGGGA TGGTGGGGCA ACCCAAATGC TTCGCAGTAT
TATTATGAAG GTATAGATGC TTCTTTTAAC AGGTATGGCC TTACTGTAAC ACAGGCAAGC
AATTATAAAA ATACACCCGG CATTAAGTGG AGTACGGCAT CTGATACCGT TGGCAGGAGC
GCCCAATTTA AAGACTGGTT GCAGATCTGT TCCAGTTATA TTCCTGCAGG TGATAATTTG
CGTCAGATTG TGATGCAACA TTGGCTGGCC ATCCCGGGGC AGGGCGTAGA TGCCTGGACA
CTGATCAGAA GGACCAGGTT GCTTGAATTT CAGCCGCAAT TTGCTACCTA TGATGGTACT
TACGCCTATG TGCCTAACCG TTTGCCATAT CCCTCAGATG AATTGCAGAC CAATATTGGA
GAAGTAAATA AAGCCATTGG CTGGCTGGGC GGTGCCGATA ACCTTAACAC GAAGTTATGG
TTTGCGTTGC CCGTTAAGAA AAATCCTTTT TTACCATTTT AA
 
Protein sequence
MKQQIIIIAL IAFGLLSCTK NFEEINTDPN KPVKVQPDFL LTTSIFETMN LFGGNMNRVV 
FFNYTHHFSG FQGNFQRYNY DLNEDNTYWR AVYVQAAQPV NQIIVNYKND PAYTNRVMIA
RIWKNYILSN AVVIWGSVPT EGALLGTPSV PYTKEQDVYV NVLADLKNLT DSLSLTGDKY
TVNADKIFGG DLLKWKKFAN TLRLRLAIRI SNDAPNGDPV LAKRVVEEVF QTEQYTMKAQ
TETAAANWGT TSDTWSPLYD RAVYNYTANK ATIPVTNESL VYHMAPYNDA RLTIYAQPAK
QGPQTGTYFG QNISYGGGST YANGLTNPHT GLKQDDYSAI GERFLKPDAE YVFLSYAEAC
FLKAEAALKG WWGNPNASQY YYEGIDASFN RYGLTVTQAS NYKNTPGIKW STASDTVGRS
AQFKDWLQIC SSYIPAGDNL RQIVMQHWLA IPGQGVDAWT LIRRTRLLEF QPQFATYDGT
YAYVPNRLPY PSDELQTNIG EVNKAIGWLG GADNLNTKLW FALPVKKNPF LPF