Gene Phep_1935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1935 
Symbol 
ID8253039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2236334 
End bp2237494 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content43% 
IMG OID644935586 
Producthypothetical protein 
Protein accessionYP_003092205 
Protein GI255531833 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0240984 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTTA CATTTTACAA AATCAACACA CAATTAATGG CCCTAAAGGC CATAACCTGT 
TTACTTTTAT CCATAGCTGG TTTAAGCGCT TCCGGACAAA ACAAGGCAGC GGGTAAAATC
CATGTCGGTA TAATTTATCC TTTAAGCACC AATGGCAGCC ACGCGGCACT CGACACCAAT
AACCTGTCTA TCCATCTGCT GGCAGGTATT TCGGCATCAG AACAGGGAGC TTCTTTTGCA
GGTATTTCTA ATATCGTACG CAATGGGACC AAAGGATTTC AGTTTGCTGC CTTTTCAAAT
CATATTGGTA AGCAGGTCGA AGGTGGCCTG TTCGCTGGCT TTTTAAATAC CTACGCAGGG
GGCGATGCAT TTGCTGTCGC AGGTTTCAGC AATGTAGCTA CAGCTGACGT TAAAGGCGCG
CAGTTCGCCG GCTTTGCCAA TGTATCCAAA AGCGTAAAAG GCGCACAGTT TGCCGGTTTT
GCCAATATTG CTAAAACTGT AAAAGGGCCT CAGTTTGCAG GTTTTATCAA TTTATCTAAA
AAAGATGCTG CCCTCCAGTT CGCAGGCTTT ATGAATAAAG CTACAGATGT TAAGGGCAGT
CAGCTGGCTG GCTTTATCAA TATCGCAAAA AAAGTTAAAG GGGCCCAGAT AGCCGGCTTT
ATCAATGTGG CCGACAGCAG CGATTATCCC ATCGGGATTA TCAATATTGT AAAAAATGGC
GAAAAAGGCA TTGGCATTAG CACCGATGAA ACACTCACTA CAATGTTGTC TTTCAGGTCT
GGTGGAAAAG TACTTTACGG CATTATCGGT ATAGGTTACA ATTTTAAAAA CACCGATGAA
GTATATGCTT TTGAAGCTGG CCTGGGTGCA CACTTTTTCC AGTCGCCCAC TTTTCGCTTA
AATGCAGAAA TTGCAGGTAC CGGACTAGAA AGTTTCAAGG CAGGCGAATA CTTCAAAACC
TCGTTTAGGT TAATGCCCGC CTTCAAGATC AGTCCTAAAC TTGAAATCTT CGGCGGACCT
TCAGTCAACT ATCTAAACAC CAATACGTTT GAAGGACGCA GCTTAAACAA AAGCTATATC
AATACATGGG AAAACAAATG GGGCAATAAT TTCCAGGCCC TGTACATCGG TTATGGAGGC
GGTATACAAT ACCTTTTTTA A
 
Protein sequence
MNFTFYKINT QLMALKAITC LLLSIAGLSA SGQNKAAGKI HVGIIYPLST NGSHAALDTN 
NLSIHLLAGI SASEQGASFA GISNIVRNGT KGFQFAAFSN HIGKQVEGGL FAGFLNTYAG
GDAFAVAGFS NVATADVKGA QFAGFANVSK SVKGAQFAGF ANIAKTVKGP QFAGFINLSK
KDAALQFAGF MNKATDVKGS QLAGFINIAK KVKGAQIAGF INVADSSDYP IGIINIVKNG
EKGIGISTDE TLTTMLSFRS GGKVLYGIIG IGYNFKNTDE VYAFEAGLGA HFFQSPTFRL
NAEIAGTGLE SFKAGEYFKT SFRLMPAFKI SPKLEIFGGP SVNYLNTNTF EGRSLNKSYI
NTWENKWGNN FQALYIGYGG GIQYLF