Gene Caul_2373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2373 
Symbol 
ID5899828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2577858 
End bp2579558 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content63% 
IMG OID641562864 
Producthypothetical protein 
Protein accessionYP_001683998 
Protein GI167646335 
COG category[S] Function unknown 
COG ID[COG4805] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCA ACCTCGCCTT GGCCGAACTG GCTGCGCGCT ATTGGGCGTT TCAATGCGAA 
GAGTTCCCGA TCAACGCGAT CGCGGCTGGG GCGGCGACCA CAGCCTCACA GCTGATGCGG
GAGGCGCCGG CGGATCATGA GCGGCGCGCC GCTTGGGCGC GGACCGCCCG CGACGCGCTC
TTGGCGATCG ACGTCGGATC GCTCGAGATT GACGACACCG CGACACATCA ACTGCTCGAT
CATGAGCTTC GTCTCACGAT CGAACTGGTC GAAAGCGGCG CACACTTGCG CCCCACGATC
TATCCGCTCG GCCCGGAATT CACACTGATC TACTGGGCGA ATTCGACGGC CCTGGCCACC
GCGACCGATG CGCGACTTTA TCTGGCCAGG CTCGCCGCGA TACCTGCATC GTTCGAGACC
GTTCAGGCGG GATTGGCCCA AGGGGTGGCC CAAGGGATGT CCTATCCGCG CCTCGTCGTG
GAACGTGCCG TCGCGCAGGT CCGCGGACAG ATCTCGGCGG CTCTGGAGGC GGACCCCTTC
TACAGTCCTC TGGGCCGGGC GGCAGCCCGG GGGGGCGTGA TGGAGGATTT GGCGGGCGAG
GGGCGTGCAC TGGTGGAAGA GGTCGTGCGA CCCGCCTTTC TCGCCTACGC GGACTTTCTC
GAAAGCACGG TGCTGCCGGT ATCGCGCGAG AGCATCTCGG GCGCCGACGA CGTCGATGGC
GAGCGTTTCT ACCGCTACAA TATCAACCAA TATACGACGG TGGATCTACC GCCCGAGGCC
ATCCACGCCA CCGGGCTGGC GGAGGTCCAG CGTCTCAAAG GCGAGATGCA GGCCGTCGCC
AGCGATGCGG GCTTCCCCAA TGACATCGAA GGCTTCCGTG ACCGCCTGAA GACCGACAAC
CGGCAATTCG CCGAAAGTGG GGAAGCATTG CGCGAGCAGA TCGAGATTCT GTCAAAACGC
ATCGATGCGA GGATCCCGGA ATTCTTCGGG CGAATACCCC GCATCAGCTA CGGCGTGAGC
AGCATTCCCG AAGCCATCGC CGAGAGAATG CCTCCGGCCT ACGCCCAGCC CAATCCGGCC
GACGGCAGCG CGGCGGGCGT CCACTGGATC ACGTCGATCC CCAGCAAATG TCCAAGCTAC
ATGCACTTGC CCTTGGCGCT GCACGAGGCC TGGCCCGGTC ATCTGATGCA TCTCGCCTTG
ATCCAGGAGA TGGATCAACT TCCCGACTTC CGCCGCTACG GGGCCATGAA ATACTCCGCC
TGCCTTGAAG GCTGGGCGCT TTATTGCGAG GCGTTGGGCG AAGACATGGG CTTTTACGAT
ACGCCGGAGA AGCGGTACGG ACGCCTAGAG ATGGAGATGT GGCGCGCGGT GCGGCTGGTC
GTGGACACCG GAATTCATTC TGGAGAATGG AGCCGCGATC AGGCTATTTC CTTCTTCCAG
GACAATATGG CGATGCCGCT CGAGACGATA ACGGCCGAGG TCGATCGCTA CATCGGTTTG
CCTGGGCAGG CGCTCGCCTA TCAGCTCGGC AATCTCAAGT TTCGCGAGCT TCGCGCCCGC
GCGCAGGCGG CTCTCGGCGA GGATTTTCGG ATCCGCGATT TTCACGACGC CCTGATGGCG
GCCGGCGCCG TGACGCTGCC TGTGCTTGAG ATGCTGATGG ACGACTGGAT CGCCGACGCG
AAGGTTGCCG TGGCCGCATG A
 
Protein sequence
MDTNLALAEL AARYWAFQCE EFPINAIAAG AATTASQLMR EAPADHERRA AWARTARDAL 
LAIDVGSLEI DDTATHQLLD HELRLTIELV ESGAHLRPTI YPLGPEFTLI YWANSTALAT
ATDARLYLAR LAAIPASFET VQAGLAQGVA QGMSYPRLVV ERAVAQVRGQ ISAALEADPF
YSPLGRAAAR GGVMEDLAGE GRALVEEVVR PAFLAYADFL ESTVLPVSRE SISGADDVDG
ERFYRYNINQ YTTVDLPPEA IHATGLAEVQ RLKGEMQAVA SDAGFPNDIE GFRDRLKTDN
RQFAESGEAL REQIEILSKR IDARIPEFFG RIPRISYGVS SIPEAIAERM PPAYAQPNPA
DGSAAGVHWI TSIPSKCPSY MHLPLALHEA WPGHLMHLAL IQEMDQLPDF RRYGAMKYSA
CLEGWALYCE ALGEDMGFYD TPEKRYGRLE MEMWRAVRLV VDTGIHSGEW SRDQAISFFQ
DNMAMPLETI TAEVDRYIGL PGQALAYQLG NLKFRELRAR AQAALGEDFR IRDFHDALMA
AGAVTLPVLE MLMDDWIADA KVAVAA