Gene Cpin_4147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4147 
Symbol 
ID8360320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5174052 
End bp5175368 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content43% 
IMG OID644966318 
Producttranscriptional regulator, AraC family 
Protein accessionYP_003123807 
Protein GI256423154 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0286987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.311664 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGGTA CTGGTTTTCA TCAAGATAAC ATTGTGATAA CATTGTTTTT ATATTTTAGC 
AACACGAGCA CACATATGAT CTATGTTATT GCAATAGGGA CTTTCCAGGC ATTTACAGCC
GTTCTATTAC TATGGACGAA CAGATTGAAA AGTAAGGCGG ACGCTTTGCT TATTTTGCTG
TTGTTATGTA TTGCCGCCCA TCTGGCGACA AAATTCTATA TCTATACTGT TGTCAGTGAT
GCGCATATCC GGTTACAGAT GAACACCTTC ATTGGTTTCT GTTATGGTCC GCTGCTTTAT
CTATATACAC TTAAAAATAA AGATGAATCG TTCATTCCTG CGTCGCGTTG GTATGTGTTC
ATTCCGTTTA TATTAGGTGC GATCGGTTAT CTGACTGTCG TATGTGTACT GGAATTCTCT
TTACAGGCGG GATATGCGGC TTTGCTGGTC TATAATCAGA TTTCGACCTG GACGATGTTA
GCTGCCGGCG CGTTTTTTCC TATGCTGACG CTCCGGGTCG CGCGAAAAAA TCTGCGCAAT
AAACCACAGG AGTTACAGCT GATAGAATGG ATCTCTTATT GTTTGTTAGC GATCACCGTT
GTTTCACTTA TTTTTCAGGG TATCAATGCA TTGCACCTGT TAGGATATCA GGACCAGATC
TTTTGCAGGG ACATTATATA TTCGATCCTG CTGGTGGTGT GTTTTATCAT TATCCGTTAT
AAATATGTGG CAGTCGTTCC GCCGGCGATG TATGTGGAAA CAGTTGTAAT ACCAGCCATT
CAGGAAGAGA TACCTGCAGA GAAGGCAAAT GTGATCGATA TACCGGAAGC TGTAATGGAA
ATTGAACCAT TACCGGCGCA TGTTGCGATA GTACAGGATT CGGCGATAGA CGATGAAGAT
ATCAGTGCAC AGTCCTCTCC TGTTCGCAGA ACCCAATTGT CGATTACCGA GCATCGTGAG
ATCATGGATA AGCTGGAACA ACACCTGCAA CGAACAAGGA TATTTACGGA TGCGGATCTG
AATATGGATA AACTGGCGGG TTCCGTTGGC ATCAGCAAAT ACCATCTTTC CGAAGCGTTG
AATTCCTATG CTTCCAAAAG CTTTTATCAG TTTATTAATG AAATGCGCAT CGAACGAGCT
ATCCAACAGA TGCAGTTTAT GAGTAGCAGA GCGCTTCCTG TAAATGTACT GACCCTCGCT
TTTGATTGCG GCTTCAAGGC CAAGTCTTCG TTTAATCAGT ATTTTAAGAA AATAACGGGG
CTGACGCCCA CGGCATACCT CCGTTCCGTC GCCGAGATGC GGACTGAAAC ATTGTAA
 
Protein sequence
MFGTGFHQDN IVITLFLYFS NTSTHMIYVI AIGTFQAFTA VLLLWTNRLK SKADALLILL 
LLCIAAHLAT KFYIYTVVSD AHIRLQMNTF IGFCYGPLLY LYTLKNKDES FIPASRWYVF
IPFILGAIGY LTVVCVLEFS LQAGYAALLV YNQISTWTML AAGAFFPMLT LRVARKNLRN
KPQELQLIEW ISYCLLAITV VSLIFQGINA LHLLGYQDQI FCRDIIYSIL LVVCFIIIRY
KYVAVVPPAM YVETVVIPAI QEEIPAEKAN VIDIPEAVME IEPLPAHVAI VQDSAIDDED
ISAQSSPVRR TQLSITEHRE IMDKLEQHLQ RTRIFTDADL NMDKLAGSVG ISKYHLSEAL
NSYASKSFYQ FINEMRIERA IQQMQFMSSR ALPVNVLTLA FDCGFKAKSS FNQYFKKITG
LTPTAYLRSV AEMRTETL