Gene Cpin_4103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4103 
Symbol 
ID8360276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5105060 
End bp5106388 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content46% 
IMG OID644966275 
Productprotein of unknown function DUF21 
Protein accessionYP_003123764 
Protein GI256423111 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00110623 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCTTAG ACATATTTTT CACCATCTTT CTGGTGCTGC TGAACGGTTT TTTTGTAGCA 
GCAGAATTTG CGATTGTGAA AGTCCGATCG TCGCAGATCG AAGTAAGTGC GGGCCGCAGC
AAGACGGTTT CGCAGGTAGC CAAAAACATT GTCAATAATC TGGACGGTTA CCTTGCTGCA
ACACAGCTGG GTATCACGTT AGCTTCTCTC GGATTGGGCT GGGTTGGGGA AAAGGTAATG
ACTGAATTGA TCCTCAATAT ATTCCATGCT CTCAATTTCA ACATGCAGGA AGCTGTTGCG
CATAAGATTG CTATTCCTAT AGCGTTCCTG GGAATTACCA TTCTGCATAT CGTATTCGGT
GAACTGGCGC CAAAATCACT GGCCATCCGT AAACCTGTTC CTACGACATT TACAGTGGCG
CTGCCCCTGA AATTGTTTTA TGTAGTATTC AGACCGTTTA TCTGGATGCT GAACAGTTTT
GCCAACGTGA TCCTGCGTAT GGTAGGTATT CGTCCGGTAC ACGAGCACGA AGACATTCAC
ACAGAAGAAG AATTACGTGT AATCATAGCA GAAAGCCATC AGGGTGGTGT TATTGAGGAA
ACAGAAAAGG CGCTTATCCA GAACGTTTTC AATCTGGGAG ATCGTCATGT ATCTGCGTTG
ATGACCCCTC GTAATGAGGT GGTATGGCTG GACGTAGATG ATGATCCGGA AGTGAATAAG
GCGAAGATCC TGACGCAGAA ACATACTGTA TATCCGATCG CTAAAGGTGA TCTGGACCAT
ACGACCGGCT TTGTATATTC CAAAGACCTG TTGAGCGATA ACTTCAACGG CGCTGTCAAT
AACCTGGAAG CGATCAGCCG TAAACTGCTG GTGGTAACAG TACACAACCG TACCTATCAG
TTGCTGGAGC TCTTCAAACG TGAGAGGATC TATCAGGCAA TGGTGGTGGA CGAATTTGGT
TCCATTAAAG GTCTGGTGAC GATCAACGAT ATCGTGGATG CACTGGTAGG TAATATCTCT
GAAACGAATG AATTTGAATA TGAGGTAATT CGCAATGAAG ATGGTAGTAT CCTGGTGGAT
GGTCAGCTGC CGTTTGTTGA ATTCCTTGAA ATGATGGGTA TTGATGCAGA TCCGCAGAAG
GTAAACGTGA CGAATTTCGT GACCCTGGGT GGTTTCATCC TGGACAGAAT GGGTAAGATC
CCTGAGGCCG GCGATAGCAT CAACTGGCGT AACCTGAAGC TGGAAGTGAT CAAAATGGAT
CAGCACCGTA TCGCCAAGGT ACACATCTGT AATTTCGATA AAGACAAAGA GAAGGATGAC
AATAAATAA
 
Protein sequence
MTLDIFFTIF LVLLNGFFVA AEFAIVKVRS SQIEVSAGRS KTVSQVAKNI VNNLDGYLAA 
TQLGITLASL GLGWVGEKVM TELILNIFHA LNFNMQEAVA HKIAIPIAFL GITILHIVFG
ELAPKSLAIR KPVPTTFTVA LPLKLFYVVF RPFIWMLNSF ANVILRMVGI RPVHEHEDIH
TEEELRVIIA ESHQGGVIEE TEKALIQNVF NLGDRHVSAL MTPRNEVVWL DVDDDPEVNK
AKILTQKHTV YPIAKGDLDH TTGFVYSKDL LSDNFNGAVN NLEAISRKLL VVTVHNRTYQ
LLELFKRERI YQAMVVDEFG SIKGLVTIND IVDALVGNIS ETNEFEYEVI RNEDGSILVD
GQLPFVEFLE MMGIDADPQK VNVTNFVTLG GFILDRMGKI PEAGDSINWR NLKLEVIKMD
QHRIAKVHIC NFDKDKEKDD NK