Gene Phep_3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3101 
Symbol 
ID8254219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3703158 
End bp3704327 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content42% 
IMG OID644936755 
Producthelix-turn-helix- domain containing protein AraC type 
Protein accessionYP_003093360 
Protein GI255532988 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.866891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.240373 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCC AGAACCTTGA CTATTCCTAT TATCTGTCAT TATTTTGCTG CCTGCACCTT 
TGGTTGCTGT GTTTATTTCT GTTCTTTAAA AAGAACCGAA GCATTGCCGA CCAGATCCTT
GCCTTGTTCC TGCTGGGTTT TTCTTTTATC CATGTGCAGC ACCTTGTGCT TCAAAAAGGG
TACTTAAATG AAATTCCATA CCTGGATCCG GTTATGGGGA TTGTCCTGTC GGCACTGGGC
CCTTTATTTT ATTTTTATGT GAGGGCCATG ACGGGGGAAC GGGAACTGTT AAAAAAATCA
AGACCACACT GGCTTATTTT AATTCCTTCT GTGATAAACC TGATTTTTCT GATGCTTACT
AAAAAAGCAG GGGAATTGCA TAATTATTAT TATGCAGATA CCAGGGGCGA AACAAAATAT
ACACTCGTTA ACCTTTTGCT GCTAATGGGG ATGACGGCTT ACCTGCTCTT TTACCTTATT
GCATCCATAC GTGTACTAAA CCGGCATACC GCTGGCATTA AAGCTTCTTA TTCCAATGTA
AAACGCCTGC AGCTGGGCTG GCTTAAAGAT TTGATTGTCA TATTAATGGT GTTTAGCTGT
ATCATTGCAC CGGTAACTAT CCTCATTGCA GATACCAAGG TGAGCCAGTT GAGCATAGCC
TATTTCAGTA CCTTCATTTA CTTTATCATT GTATACAAGT CATTAAATTA TTCTGTGGTA
TTTGCCCCGT TGGCCCTGCC GGAAGACCTG CCTCTAGCCT CCGAACCGGA GCTGCCACTG
CCGGTTACAG AGCGCTACCA GAAATCGGCC TTATCTCCTG CACAGGTAGA AGAGTATGGG
CATACGATAG AGATATTCCT GGTATCTAAT AAGCTGCTTT TTGATGAAGA CCTGTCGCTC
AGACAGATGG CCGATGTATT AAAACTATCG CCACATGTAT TGTCGGAAGT CATTAACCGT
TATTACAATA AAAGCTTTTT TGATCTGATC AATTCATTCA GGATTGAAGA AGCCAAAAAG
CAGTTAAGGA ATATCAACGA ACTGAACATC ACCATTGAAG GCATTGGTTA TAACTGTGGT
TTCGGGTCAA AAACTACATT TTACAGGGCT TTTAAAAAGC ATACCGGACA TACACCTACA
GGCTATGTGA CCCAAAATAG CATAACCTGA
 
Protein sequence
MAIQNLDYSY YLSLFCCLHL WLLCLFLFFK KNRSIADQIL ALFLLGFSFI HVQHLVLQKG 
YLNEIPYLDP VMGIVLSALG PLFYFYVRAM TGERELLKKS RPHWLILIPS VINLIFLMLT
KKAGELHNYY YADTRGETKY TLVNLLLLMG MTAYLLFYLI ASIRVLNRHT AGIKASYSNV
KRLQLGWLKD LIVILMVFSC IIAPVTILIA DTKVSQLSIA YFSTFIYFII VYKSLNYSVV
FAPLALPEDL PLASEPELPL PVTERYQKSA LSPAQVEEYG HTIEIFLVSN KLLFDEDLSL
RQMADVLKLS PHVLSEVINR YYNKSFFDLI NSFRIEEAKK QLRNINELNI TIEGIGYNCG
FGSKTTFYRA FKKHTGHTPT GYVTQNSIT