Gene Cpin_4811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4811 
Symbol 
ID8360987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5999523 
End bp6002438 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content48% 
IMG OID644966961 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_003124446 
Protein GI256423793 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000406891 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000000101535 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTTGAGAA ACAAACGATG GGCGCTTTCA GCGGTCTTAG GGATATTATG CCTGGCAGCA 
AACGGACAGG AAAAGATCAT GACCTTAAAC AGCAGCAATG CGGCCGTAAG CTGGAAAGTG
AAAGCGGCAG CGGAGCTGGG ACAGACAACG GATATTCATG CAACAGCTTA CAACGACCAG
CAATGGGTGA AAGGGATTGT ACCCGGGACG GTGTTTGGCT CATTTGTGGC CGCCGGATTA
GAGAAAGATC CTAACTATGC GGATAACATT TACCAGGTAG ATAAAGCGAA GTATGACCGG
GATTTCTGGT ATCGCAGCAC GTTTAAGTTT TCCCGCCGGA AGGCAGGAGA GCAGCAATGG
CTCAACTTTG AAGGCGTGAA CCGGAAAGCG GAAGTATTCC TGAATGGCCA TCGGCTTGGT
TTACTGGACG GCTTTATGGA CCGGGGAAAG TTTGACGTGA CGAACCTGTT ACGGTATGAT
CAGCCGAATG TACTGGCCTT GCTGGTGAGC TGGCCAGGTA CGCCTATCGT GAACTATTCA
AGTCCGACGT ATATTTCCAG CGCCAGCTGG GACTGGATGC CCTATGTACC CGGACTGAAC
ATGGGTATTA CGGATGATGT ATACATTACC GGTTCCGGCG CAATCACTAT CCAGGACCCA
TGGGTACGTA CCAGTGCCGC GGATACCTCA CTCGCTAAAC TGAGTATTTC GATGGAGTTG
GATAATCATT CTGCACAGGC ACAGGAGGGT ACCATTTCCG GCACGATTCA GCCGGGTAAT
ATCCGATTCT CAAAGAATGT AAAACTATCA GCCGGACAAA CGGAACAAGT CTCTTTTCAG
CCGGAAATAG CCCATCCTGC GCTTTGGTGG CCGAACGGGT ATGGCAGTCA GCCCTTATAT
ACCTGTGACC TGCAATTCAC CGTAAAAGAC AGCGTGTCTG ACAGCCATAA CGTGACTTTC
GGTGTGAGAC GTTTCAGTTA TGACACCACA GGTGGTGTCT TGCATATCTA TATCAACGGG
CAAAAGATCT TCATCAAGGG CGGTAACTGG GGGATGTCGG AGTACCTGTT ACGTTGTCGT
GGCAGTGAAT ATGATACGAA GCTAAAGCTG CATCGTGAGA TGAATTTTAA TATGGTGCGC
AACTGGATTG GAAGTACGAC AGATGAAGAA TTCTATACTG CCTGTGATAG ATATGGCTTG
CTGGTATGGG ATGATTTCTG GTTGAACTCA CATCCCAATC TGCCTAAAGA TATCTTTGCT
TTCAACAGGA ATGCGGTGGA GAAGATCAAG CGGCTTCGTA ATCATGCCAG TATCGCGGTA
TGGTGTGGCG ATAATGAAGG TTATCCTTTA CCACCTTTGA ATAATTGGTT GAAAGAGGAC
GTCAGCACAT TTGATGGGAA TGACCGTTTG TATCAGGCGA ATTCTCATGC TGATGGTCTG
ACGGGTAGTG GTCCGTGGAC GAACTTTGCA CCCGCCTGGT ATTTTACCAG ATTTCCCGGT
GGATTTGGCG GTACGCCCGG ATGGGGGCTA CGTACTGAGA TCGGTACGGC GGTGTTTCCT
TCTTTTGAAA GCTTTAAACA GTTTATGCCG GACAGCAGCT GGTGGCCACG TAATAAAATG
TGGGACCTGC ATTTCTTTGG TCCGTCGGCG GCTAATGCTG GTCCGGACAG ATATGACGAA
GCGATCAACA AAGGTTACGG TACTGCCAGC GGTATTGAAG ACTATTGCCG GAAGGCGCAG
CTGGTGAATA TTGAAGTCAA CAAGGCGATG TATGAGGGTT GGTTACATAA TATGTGGAAG
GATGCATCGG GTATTATGAC CTGGATGAGT CAATCTGCTT ATCCGAGTAT GGTATGGCAG
ACTTACGACT ATTACTATGA CCTGACAGGC GCTTACTGGG GCGTGAAAAA AGCCTGCGAG
CCTTTGCACA TTCAATGGAG TGCGGCGGAT AATTCCGTGA AGGTGGTGAA TACTACTTTA
CAGGATTATA GTAACCTGAA GGCGGAAGCG ATTGTATACA ATATGGATGG AACGATTGCT
AAACAGATTG GTCAGACAGC GACTGTCCGT GCCGCTGCGA ATAATACAAC GCCTTGTTTT
GATCTGAATT TTAATGCAGA CAATCTGGCA TTCAGGAAGA CAGTGGTAGC TTCCTCTTCT
TCTCCGGAGA GTGCAGGCAC TGCTGCCGCA GCAGATGGCA GTGTAGGTTC CCGCTGGAGC
AGTAATTACA ACGACAATGA ATGGATCTAT GTGGATCTGG GTGTTGCGCA GGAGATCAGT
AATGTCGTAT TGATCTGGGA AGACGCACAT GCGGCAGCTT ATAATTTACA GGTCTCTGAT
GATGCGCAGT CCTGGACAGA TGTCTATAAG ACAGAGACCA GTAAAGGCGG TACTGAAACC
ATTGCTTTAC AGGCGGTGAA GGCGCGTTAT GTAAGAATGC TCGGACGTAA GCGGGCTTCA
CAATGGGGGT ATTCTTTGTA TGAGTTGGAG GTATATGGGA AACGTAGTGC GACCCTTTCA
GAAGTGCAAT TTATACGCTT GCGCCTGAGT GATGCCAAAG GTAGTCTGCA ATCTGATAAT
TTCTATTGGA GAGGCAACCG GAATGGTGAT TATACGGCCT TGAATCAATT ACCGGCAGTA
CAGCTGAAGG TGGGTTCGAA GGCGGTGCAG GTGAGTGATA GTACACGTAT TACAGCGACG
GTGAGCAATC CTTCCAATGC TGCCGGACCT GCATTTGCGG TATGTGTGCA GGTAGTGAGA
GCGGATAACA ATGAGCGGGT ATTACCGCTT GTGATGAGTG ACAACTATTT CACTTTGCTG
AAAGGTGAAA GCAAACAGCT GGAGATCTCT TTTGAGAAGC GATTGCTGGA GAGTGGTAAG
TACAAATTGA TCGTTACACC TTACAATCAT AAATAG
 
Protein sequence
MLRNKRWALS AVLGILCLAA NGQEKIMTLN SSNAAVSWKV KAAAELGQTT DIHATAYNDQ 
QWVKGIVPGT VFGSFVAAGL EKDPNYADNI YQVDKAKYDR DFWYRSTFKF SRRKAGEQQW
LNFEGVNRKA EVFLNGHRLG LLDGFMDRGK FDVTNLLRYD QPNVLALLVS WPGTPIVNYS
SPTYISSASW DWMPYVPGLN MGITDDVYIT GSGAITIQDP WVRTSAADTS LAKLSISMEL
DNHSAQAQEG TISGTIQPGN IRFSKNVKLS AGQTEQVSFQ PEIAHPALWW PNGYGSQPLY
TCDLQFTVKD SVSDSHNVTF GVRRFSYDTT GGVLHIYING QKIFIKGGNW GMSEYLLRCR
GSEYDTKLKL HREMNFNMVR NWIGSTTDEE FYTACDRYGL LVWDDFWLNS HPNLPKDIFA
FNRNAVEKIK RLRNHASIAV WCGDNEGYPL PPLNNWLKED VSTFDGNDRL YQANSHADGL
TGSGPWTNFA PAWYFTRFPG GFGGTPGWGL RTEIGTAVFP SFESFKQFMP DSSWWPRNKM
WDLHFFGPSA ANAGPDRYDE AINKGYGTAS GIEDYCRKAQ LVNIEVNKAM YEGWLHNMWK
DASGIMTWMS QSAYPSMVWQ TYDYYYDLTG AYWGVKKACE PLHIQWSAAD NSVKVVNTTL
QDYSNLKAEA IVYNMDGTIA KQIGQTATVR AAANNTTPCF DLNFNADNLA FRKTVVASSS
SPESAGTAAA ADGSVGSRWS SNYNDNEWIY VDLGVAQEIS NVVLIWEDAH AAAYNLQVSD
DAQSWTDVYK TETSKGGTET IALQAVKARY VRMLGRKRAS QWGYSLYELE VYGKRSATLS
EVQFIRLRLS DAKGSLQSDN FYWRGNRNGD YTALNQLPAV QLKVGSKAVQ VSDSTRITAT
VSNPSNAAGP AFAVCVQVVR ADNNERVLPL VMSDNYFTLL KGESKQLEIS FEKRLLESGK
YKLIVTPYNH K