Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_4103 |
Symbol | |
ID | 8360276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 5105060 |
End bp | 5106388 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644966275 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003123764 |
Protein GI | 256423111 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00110623 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACCTTAG ACATATTTTT CACCATCTTT CTGGTGCTGC TGAACGGTTT TTTTGTAGCA GCAGAATTTG CGATTGTGAA AGTCCGATCG TCGCAGATCG AAGTAAGTGC GGGCCGCAGC AAGACGGTTT CGCAGGTAGC CAAAAACATT GTCAATAATC TGGACGGTTA CCTTGCTGCA ACACAGCTGG GTATCACGTT AGCTTCTCTC GGATTGGGCT GGGTTGGGGA AAAGGTAATG ACTGAATTGA TCCTCAATAT ATTCCATGCT CTCAATTTCA ACATGCAGGA AGCTGTTGCG CATAAGATTG CTATTCCTAT AGCGTTCCTG GGAATTACCA TTCTGCATAT CGTATTCGGT GAACTGGCGC CAAAATCACT GGCCATCCGT AAACCTGTTC CTACGACATT TACAGTGGCG CTGCCCCTGA AATTGTTTTA TGTAGTATTC AGACCGTTTA TCTGGATGCT GAACAGTTTT GCCAACGTGA TCCTGCGTAT GGTAGGTATT CGTCCGGTAC ACGAGCACGA AGACATTCAC ACAGAAGAAG AATTACGTGT AATCATAGCA GAAAGCCATC AGGGTGGTGT TATTGAGGAA ACAGAAAAGG CGCTTATCCA GAACGTTTTC AATCTGGGAG ATCGTCATGT ATCTGCGTTG ATGACCCCTC GTAATGAGGT GGTATGGCTG GACGTAGATG ATGATCCGGA AGTGAATAAG GCGAAGATCC TGACGCAGAA ACATACTGTA TATCCGATCG CTAAAGGTGA TCTGGACCAT ACGACCGGCT TTGTATATTC CAAAGACCTG TTGAGCGATA ACTTCAACGG CGCTGTCAAT AACCTGGAAG CGATCAGCCG TAAACTGCTG GTGGTAACAG TACACAACCG TACCTATCAG TTGCTGGAGC TCTTCAAACG TGAGAGGATC TATCAGGCAA TGGTGGTGGA CGAATTTGGT TCCATTAAAG GTCTGGTGAC GATCAACGAT ATCGTGGATG CACTGGTAGG TAATATCTCT GAAACGAATG AATTTGAATA TGAGGTAATT CGCAATGAAG ATGGTAGTAT CCTGGTGGAT GGTCAGCTGC CGTTTGTTGA ATTCCTTGAA ATGATGGGTA TTGATGCAGA TCCGCAGAAG GTAAACGTGA CGAATTTCGT GACCCTGGGT GGTTTCATCC TGGACAGAAT GGGTAAGATC CCTGAGGCCG GCGATAGCAT CAACTGGCGT AACCTGAAGC TGGAAGTGAT CAAAATGGAT CAGCACCGTA TCGCCAAGGT ACACATCTGT AATTTCGATA AAGACAAAGA GAAGGATGAC AATAAATAA
|
Protein sequence | MTLDIFFTIF LVLLNGFFVA AEFAIVKVRS SQIEVSAGRS KTVSQVAKNI VNNLDGYLAA TQLGITLASL GLGWVGEKVM TELILNIFHA LNFNMQEAVA HKIAIPIAFL GITILHIVFG ELAPKSLAIR KPVPTTFTVA LPLKLFYVVF RPFIWMLNSF ANVILRMVGI RPVHEHEDIH TEEELRVIIA ESHQGGVIEE TEKALIQNVF NLGDRHVSAL MTPRNEVVWL DVDDDPEVNK AKILTQKHTV YPIAKGDLDH TTGFVYSKDL LSDNFNGAVN NLEAISRKLL VVTVHNRTYQ LLELFKRERI YQAMVVDEFG SIKGLVTIND IVDALVGNIS ETNEFEYEVI RNEDGSILVD GQLPFVEFLE MMGIDADPQK VNVTNFVTLG GFILDRMGKI PEAGDSINWR NLKLEVIKMD QHRIAKVHIC NFDKDKEKDD NK
|
| |