Gene Cpin_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4044 
Symbol 
ID8360217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5031377 
End bp5032678 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content45% 
IMG OID644966217 
Producttyrosyl-tRNA synthetase 
Protein accessionYP_003123706 
Protein GI256423053 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.225408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0290637 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACGA TTATTGTAAA TAAGATATAC CATCATAGCT CACTGGAGCC CCGGACATTC 
AATTGTCCGG GGCTTTTTTA TTTCTATCAG ATCATGACAA CAACAACACT ACCCACGGCA
AATGCAGCAG TCATATTGCC AGAAAACGGC CTCACAGAAA AAATCAGGCA GGCCGAAAAA
GACAATCGTA ACCTGATCAT TAAACTGGGC TTTGATCCTA CCGCCCCTGA CCTACACCTC
GGACACGCCG TAGTCCTGAA ACAGCTACGC GCATTTCAGG ACCTGGGTCA CCAGGTCGTT
ATTATCGTAG GCAGCTTCAC CGCACAGATC GGAGATCCTA CCGGTAAAAA CAAAAGCCGG
AAGCCTTTAA GCAAGGAAGA AGTGCTGGCG AATGCGGATA CTTACATCCG TCAGTTAGCG
AAGATCATTG ACACCAACAA ATGTCAGATC CTGTTCAATG GAACCTGGTT GGATAAGCTG
TCCTTTCCCG AGATCCTTCA GTTACTTTCC AAAGTAACCG TCGCCCAATT GCTGCATCGC
AATGACTTCA ACAAACGATT TACGGAAAAC GTGCCCATCG CTATGCACGA ACTCATGTAC
CCTATCCTGC AAGGATTTGA TTCTGTGCAG ATTAACGCAG ACATTGAAAT GGGTGGTACA
GATCAGTTAT TCAATTGCAC CATGGGCAGA CAATTGCAGG AAGCGCACGG GCTTCCACCT
CAAGTTGTAA TGTGTATGCC CCTGCTAAGA GGTCTTGATG GAAAGGAGAA AATGAGCAAG
TCCCTGAATA ACATTATCGG TCTGACGGAT ACACCCAACG AGATGTTTGG AAAGACCATG
TCCATCCCCG ACCACCTGAT TGATGAATTT ATTGATCTGA CCACAGATTT CTCCCCCGAG
AAAAAGCATG CCCTGAAACA ATTAATGGTA TCCGGTGAAA ATCCTATGAA TATCAAGAAG
ATCATTGCTG CGAATATCAT CACACAATAC CACGATACCA ACAGCGCCGC ATCCGCAGAA
GCCTTCTTTA TCAATCAGTT CCAGAACAAG TCCTTTGAAG AAAAGACCTT CGAACAGATC
GCCATTAACT CCTTCACTGA GGGAACAGCC ACCACCACCA CATTGCTGGA CCTCTGCCAA
CAACTAAAAA CGGACGAAAC CAGAAGCGGC ATCAGAAGGT TGATTATCAA TGGTGCAATC
ACTTTGGATA ATGAAAAATT GCTCGATCCC AATCAGCAAA TACATCTACG TCCAGGTATA
AAGATTAAGA TCGGCAGGCG ACTTTTCATA GAACTTATAT GA
 
Protein sequence
MQTIIVNKIY HHSSLEPRTF NCPGLFYFYQ IMTTTTLPTA NAAVILPENG LTEKIRQAEK 
DNRNLIIKLG FDPTAPDLHL GHAVVLKQLR AFQDLGHQVV IIVGSFTAQI GDPTGKNKSR
KPLSKEEVLA NADTYIRQLA KIIDTNKCQI LFNGTWLDKL SFPEILQLLS KVTVAQLLHR
NDFNKRFTEN VPIAMHELMY PILQGFDSVQ INADIEMGGT DQLFNCTMGR QLQEAHGLPP
QVVMCMPLLR GLDGKEKMSK SLNNIIGLTD TPNEMFGKTM SIPDHLIDEF IDLTTDFSPE
KKHALKQLMV SGENPMNIKK IIAANIITQY HDTNSAASAE AFFINQFQNK SFEEKTFEQI
AINSFTEGTA TTTTLLDLCQ QLKTDETRSG IRRLIINGAI TLDNEKLLDP NQQIHLRPGI
KIKIGRRLFI ELI