Gene Caul_1560 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1560 
Symbol 
ID5899015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1649205 
End bp1650422 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content70% 
IMG OID641562048 
Producttail sheath protein 
Protein accessionYP_001683188 
Protein GI167645525 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0705385 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTCCT ATTTGAGCCC CGGCGTCTAC GTCGAGGAAA TCGACTCTGG ATCCCACTCC 
ATCGAAGGGG TGGGTACATC GGTGGCCGGG TTTGTGGGGG CCGCGCCCGA TCCTGACGCG
CTGCTCGACG AGGCGGTGGC GATCAACAAC TGGAGCGAAT TCCGGCGCAA ATACGTGCGC
GACGGCGACA AGGGCACCGA CCTGGCCAAC GCCGTCTACG GCTTTTTCCT CAACGGCGGG
TCACGCTGCT ACGTCGTCAA TACCAAGGCC GACGGGGCGA TCGCCGGCAA GGGGCGAGGG
CTGGACGCCC TGGCGGCCAT CGACGAGATC GCCATCATCG CCGCGCCTGG ACGCACCGAC
GCGGCGTCCC ACGGGGCGCT GCTCGACTCG GCGGAATCGC TGAAGGACCG CGTGGCGATC
CTGGACGCCC CGCCCCGTGT CGACGATGTC GAGGTCCTGA CCCGCGCGGC CGACGGCAGC
GCGCCGCCGA CGCCGCCCAC AGCCGAGGGT GACGCCCCGC CGCCGCCGCG TTCGCGCGGC
CCCAAGCCGG GCCAGCGTCC CCGGGATTCC GATGGCGGCT ACGGCGCCTG CTATTTCCCT
TGGCTGAAGG GGCGCGACGC CATCGATCCC GACACCCAGG CGCAGATCCC GCCATCGGGA
CACATGGCTG GGATCTACGC CCGTACCGAC AGCGAGCGCG GCGTGCACAA GGCGCCGGCC
AACGTCACGA TCCGCGGGGC CGAGGGCCTT ACCCAGGTGT TGTCGCGGGC CGAGCAGGAC
GTGCTCAATC CGGTCGGCGT CAACTGCATC CGCTTCTTCA CCCGGGAGGG CGTGCGGGTC
TGGGGCGCCC GGACGCTCGC GCCAAGCTCC AGCAACTGGC GCTACCTGAA CGTCCGCCGG
CTGTTCAACA TGATCGAGGA GTCTATCGCC ATCAGCACGC GCTGGGTGGT GTTCGAGCCC
AATGCCGGTC CGCTGTGGAA GGACATCCAG CGCGACGTCG GGGCCTTCCT GACCCTGTTG
TGGCGCCAGG GCGCCCTGGC CGGGGCGCGG CCCGAGGACG CCTTCTTCGT CAAGTGCGAC
GCGGAGACCA ATCCGCCGGA GGTGGTCGAC GCCGGCCAGG TGGTGGTGGT GATCGGCATC
GCGCCGGTGA AGCCCGCCGA GTTCGTCATC TTCCGGATCG GCCAGAGCGC GGTCGGATCC
ACGGTCGAGG CCGCCTGA
 
Protein sequence
MPSYLSPGVY VEEIDSGSHS IEGVGTSVAG FVGAAPDPDA LLDEAVAINN WSEFRRKYVR 
DGDKGTDLAN AVYGFFLNGG SRCYVVNTKA DGAIAGKGRG LDALAAIDEI AIIAAPGRTD
AASHGALLDS AESLKDRVAI LDAPPRVDDV EVLTRAADGS APPTPPTAEG DAPPPPRSRG
PKPGQRPRDS DGGYGACYFP WLKGRDAIDP DTQAQIPPSG HMAGIYARTD SERGVHKAPA
NVTIRGAEGL TQVLSRAEQD VLNPVGVNCI RFFTREGVRV WGARTLAPSS SNWRYLNVRR
LFNMIEESIA ISTRWVVFEP NAGPLWKDIQ RDVGAFLTLL WRQGALAGAR PEDAFFVKCD
AETNPPEVVD AGQVVVVIGI APVKPAEFVI FRIGQSAVGS TVEAA