Gene Caul_1824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1824 
Symbol 
ID5899279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1933307 
End bp1935379 
Gene Length2073 bp 
Protein Length690 aa 
Translation table11 
GC content68% 
IMG OID641562314 
Producttail sheath protein 
Protein accessionYP_001683451 
Protein GI167645788 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.414219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0927091 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTGC AAACAACCTA TCCAGGCGTC TACATTCAGG AACTACCCAG CGGCACCCAC 
ACCATCGTCG GCGTAGCGAC GTCGATTACC GGTTTCATCG GGTATCTAAG GCGCGGGCCG
ATCAACACCG CCGTGCCGTT GCTCAATTTC GGCGATTTTC AGCGGGTCTA CGGCGGCATC
GACAGCCTGA GCGACACCAG CTACCAGATC TCGCAATTCT TCCTGAACGG CGGGACCGAG
TGCTGGGTCA CCCGCTGCGT CGACGCCACC CAGCCGCCCG TCCTGCCCGT TGCTCCGGTC
CCGGGCGGAC CGATCGGGGG GACCGGCAAG CCCGGCACGA CGCCCACGCC GCCAACGGGC
GTGGCGACGC TGAGCGCCTC GGCCGCCAAT CCTGGGACCT GGGGTGAAAA CCTCTACCTG
ACCATCGACT ACATGACGCC GTCGGGGTCC AACGCCTTCA ACCTGTACGC AACGCTCTAC
AACTTCGCGA GCGGCGGCGC GTCGGTGGTG CGGACCACCG GCTTGACCGG CGTCACCATG
GACATGACCC AGCCGAACTA TATCGGCGAA GCCCTGCTGA GCCTGAGCAG CACCTACACC
AACCTGATCA ACATCGCGGA CCTTGTGAAC CCGGCGACGG CGGGTCCGGG CTATGTCCCG
CCCGCCGCGC CGCTGCCCAG CGGCACGATC CTGCAGTTCG TCCCGCCCGC CCAGGCGGCC
GCCGGCGTCA CCCTGACCGT CTCGGTGGTC CTGCCGACGC CGCCGGGGCA GAAGACCCCG
CCGACGCCGC CCGCCCCCGT GACCGTGGCG GTCGGTTCGA TCCTGACCTT CCAGGACTTC
GTGGCCGCGG TGCAGAGCGC CCTGGCGGTG GCCGGCGCCC AGATGAACAT CCCCGCCCTG
GCCAGCGCCG CGGTCCGCAC CTTCGCCCTG CCTTTCGTGA CCACTCCGCC GGCCAGCGCC
ACGCCGAACA TGGTCCAGAT CCTGCTGACC GATCCCCTGC AGGCGGGCGT TCTGGTGTCG
GTCGTGGGCT CGTCCAAGTC GCTGTTCAAC CAGCTGCAGA CTAACATCCA CGCGATCAAC
CTGGCCGCCC CGCCCCCGCC CTCGGCCAGC CCTCCGCCGG CCGGCGGTTC CGCCCCGCCG
CCCCCGCCGC CCGACGGCGC GCTTCCCAAG GGCATCGATA TCGCGGGCAA CTCGACCAAC
CGCACCGGCG TCTACGCCTT CGACGGCGTC AGCATCATCA ACCTGCTGAG CGCGCCCGAC
CTGCGCTACA TGACGACGTC GGACTATCTG ACCAGCGCCA CCTCGATCCT CAACTACGTT
CTGCAGCGGC GAGCCTTCGC GATCCTCGAC CTGCCCAGCA CCGTCAACTC GGTGCCGCTG
GCCTCGGCCT GGGTCTCGAC CATCCCGCCC AGCTTCGGCC CGGGCATCAT CAGCGCCGCG
GCCTATTATC CCGAACCCGA GGTGCCCGAC CCGTTCAGCT CGCAGCCGCG CTCGATCGGG
GCAAGCGGCA CCATGGCTGG CCTCTACGCC CAGACCGACC TGACGCGGGG GGTGTGGAAG
GCGCCTGCCG GGATCACCGC GGCCCTGACC GGCGTGCAGG AACTGGCCTA TGTGATGACC
GACCAGGAGA ACGGCATCCT CAACCCCCAG GGGATGAACG CCCTGCGCAC CTTCCCGGTC
TATGGCAGCA TCGCCTGGGG CGCGCGGACC CTGGCCTCGG CCAACCCCGC CGACGACGAC
TGGAAGTACA TCAACGTCCG CCGCCTGGCG CTCTACATGG AGCAGAGCCT GGTCGCCGGC
CTGCAGTGGG TGGTGTTCGA GCCCAACGAC GCGACGCTCT GGGCGCAGAT CCGCCTGACG
GTGACCAGCT TCCTGCACCC GCTGTTCCAG CAGGGCGCCT TCGTCGGATC CACCCCGGAC
CAGGCCTACC AGGTGATCTG CGACGCCTCG ACCACCACGC CGGAGGACAT GGACAACGGG
ATCGTCAACA TCCTGATCCT GTTCGCCCCG GTGAAGCCGG CTGAATTCGT GGTGATCAGC
ATCCAGCAGA TGGCTGGGCA ATCCTCGAGC TGA
 
Protein sequence
MPVQTTYPGV YIQELPSGTH TIVGVATSIT GFIGYLRRGP INTAVPLLNF GDFQRVYGGI 
DSLSDTSYQI SQFFLNGGTE CWVTRCVDAT QPPVLPVAPV PGGPIGGTGK PGTTPTPPTG
VATLSASAAN PGTWGENLYL TIDYMTPSGS NAFNLYATLY NFASGGASVV RTTGLTGVTM
DMTQPNYIGE ALLSLSSTYT NLINIADLVN PATAGPGYVP PAAPLPSGTI LQFVPPAQAA
AGVTLTVSVV LPTPPGQKTP PTPPAPVTVA VGSILTFQDF VAAVQSALAV AGAQMNIPAL
ASAAVRTFAL PFVTTPPASA TPNMVQILLT DPLQAGVLVS VVGSSKSLFN QLQTNIHAIN
LAAPPPPSAS PPPAGGSAPP PPPPDGALPK GIDIAGNSTN RTGVYAFDGV SIINLLSAPD
LRYMTTSDYL TSATSILNYV LQRRAFAILD LPSTVNSVPL ASAWVSTIPP SFGPGIISAA
AYYPEPEVPD PFSSQPRSIG ASGTMAGLYA QTDLTRGVWK APAGITAALT GVQELAYVMT
DQENGILNPQ GMNALRTFPV YGSIAWGART LASANPADDD WKYINVRRLA LYMEQSLVAG
LQWVVFEPND ATLWAQIRLT VTSFLHPLFQ QGAFVGSTPD QAYQVICDAS TTTPEDMDNG
IVNILILFAP VKPAEFVVIS IQQMAGQSSS