Gene Cpin_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3021 
Symbol 
ID8359185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp3741232 
End bp3742350 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content43% 
IMG OID644965201 
Producttranscriptional regulator, AraC family 
Protein accessionYP_003122698 
Protein GI256422045 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTTG AGACAATATG CGCAAAACCT AATTTTGTTG TTATGAAAGA AGGGGTTCTT 
GTAGTGAAAA GAAAGACAGA GAAAAAGAAG GTTGTTATTG TGGCTATGAC TGGCCGTATG
CTAATGGACT TCGTTGGCCC GGCAGACGTA TTTACAACTG CCAATATATT TTTAAAACAC
TCCGATATCC ATGAAGGATA TGATGTTAGG ATCGCTTCTC CTACAGCTGA AAAAAAAGTA
GTTACAGGAG CAGGTGCAGA TATTTTATGC CAGGTTAGTG CAATTGACAT CAGGTCGCGT
ATCGATACGC TGATTATATC CAATTATGAC TCCCATGAAT CTTTCACCGA ATTGTTCGAA
CCTTTCTATA GATGGCTTTC AAAAAGAACA CCCAATAATA CAAGGAGGAT CGCTTCTGTG
TGTGCGGGCG CATTTGCTTT AGCGCAGGCA GGTCTTATCA ACAATCATAA AGTAACAACA
CACTGGGGAC TTAACGAAAA ACTACAAAAA ACCTATCCAC AGCTCAGCGT AGACACGAAT
CCCTTCTTTG TCAAGGATGG ACATATCTAT ACATCAGGCG GGGTATCTTC AGGCATCGAC
CTGGCTTTGG CTATGGTCGA GGAAGATTTT GGTAAGGAAA TAGCTATCCA GGTAGCGAGG
GAACTGGTAG TCTATTTATA TAGACCAGGT TATCAAAGCC AGTTTGCTAA TCTGTTACCG
TCCACGGAAA GTACAGGTCT TAGCCAGAAA TTACGAACCT GGGTGCTGGA ACATTTGAAT
GAACAATTAG ATGTGAGAAG GATTGCAGAT CACCTGAATA TGAGCCCCCG TAATTTTACC
AGGGTATTCA ATAAGCAGAC AGGTTCTTCC CCGGCAAAAT TTGTAGAGAA AGTGCGTGTC
GAGCAGGCCA GGAGATTATT GGAGGATACT GACAACTCAC TGGAAAGTAT TGCAGAAATG
TGCGGATTCG GTGGTCTTAG TTCTCTGCGT CGAACATTTT TAAGGCTGCT AATGACCACT
CCGTCCGATT ACCGACGGGT ATTCAGGAAA GCACTGCGAG ATGCCGGCTT AGGTGAACAT
TACCCATTGA ATATTATCAA CCAGCATGAG ATGGACTGA
 
Protein sequence
MSFETICAKP NFVVMKEGVL VVKRKTEKKK VVIVAMTGRM LMDFVGPADV FTTANIFLKH 
SDIHEGYDVR IASPTAEKKV VTGAGADILC QVSAIDIRSR IDTLIISNYD SHESFTELFE
PFYRWLSKRT PNNTRRIASV CAGAFALAQA GLINNHKVTT HWGLNEKLQK TYPQLSVDTN
PFFVKDGHIY TSGGVSSGID LALAMVEEDF GKEIAIQVAR ELVVYLYRPG YQSQFANLLP
STESTGLSQK LRTWVLEHLN EQLDVRRIAD HLNMSPRNFT RVFNKQTGSS PAKFVEKVRV
EQARRLLEDT DNSLESIAEM CGFGGLSSLR RTFLRLLMTT PSDYRRVFRK ALRDAGLGEH
YPLNIINQHE MD