Gene Caul_2183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2183 
Symbol 
ID5899638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2373392 
End bp2374582 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content69% 
IMG OID641562674 
Producthypothetical protein 
Protein accessionYP_001683809 
Protein GI167646146 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.286863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAAGACT TCGACGTCGT GGTGCTGGGC GCCGGCGCGG CCGGGATGAT GTGCGCCATC 
GAGGCGGGCA AGCGCGGTCG CCGGGTGCTG GTGCTCGACC ACGCCACGGC TCCCGGCGAG
AAGATCCGCA TCAGCGGCGG CGGGCGCTGC AACTTCACCA ACGTCAACAC CGCCCCGGCC
AACTTCCTGT CGGCCAATCC GAAGTTCTGT GTCTCGGCCC TGCGCCGCTA CCGGCCGGCC
GACTTCATCG CCCTGGTGCG GAAGTACGGC ATCGCCTTCC ACGAAAAGGC GCTGGGTCAG
CTGTTCTGCG ACGGCTCGGC CAAGCAGATC ATCGAGATGC TGCTGACCGA GATGGACAAG
GCCGGCGTCA CGCTGCGCCT GGGAGTCGAG GTCAAGGGCG TGGCCAAGGA CGACAATGAC
CAGGGCGGCG GCTATGTCGT CAGCACCTCG CGCGGCCCGA TACGCTGCGC GTCCTTGGTG
GTCGCCACGG GCGGCAAGTC GATCCCCAAG ATGGGCGCGA CGGGCCTGGG CTATGAGATC
GCCCAGCAGT TTGGCTTGGC GATCCAGGAA ACTCGCCCGT CCCTCGTGCC CCTGACCTTC
GAGGTCAAGA CCCTGGAGCG CCTGGCCCCG CTGTCGGGCG TGGCGCTGGA TGCGGTGGTG
GCCAGCGGCA AGACCAGGTT CGCCGAGGCC ATGCTGTTCA CTCACCGGGG CCTGTCGGGG
CCATCGATCC TGCAGATCTC GTCCTTCTGG CGCGAGGGCG GCGAGATCAG CATCAACATG
GCGCCGGGCG TCGATGTGTT CGCGGTCCTG AAGGCGGCGC GCACCGCCAC GCCCAAGCGG
GCCCTGGCGA CGGTGCTGTC GGAAATGCTG CCCAAGCGCC TGGCCCAGCT GATCGCCGAG
GACGCCGGGG CCTATAGCAA CCTGGCCGAC TGCTCCGACG TGCTGCTGCG CGGCGTGGCC
GCGACGGTCA ACGCCTGGAC GTTCAAGCCG GTGGGTTCGG AAGGCTACCG GACGGCGGAA
GTCACGCTGG GCGGGGTCGA CACCGACGGC CTGGACTCGC GGACCATGGA GGCCAAGGCG
GTCCCGGGGC TGTATTTCAT CGGCGAGGTG GTGGACGTCA CCGGCTGGCT GGGCGGGTAT
AATTTCCAGT GGGCCTGGGC GTCGGGCTGG TGCGCGGGGC AGGCGGTTTA G
 
Protein sequence
MEDFDVVVLG AGAAGMMCAI EAGKRGRRVL VLDHATAPGE KIRISGGGRC NFTNVNTAPA 
NFLSANPKFC VSALRRYRPA DFIALVRKYG IAFHEKALGQ LFCDGSAKQI IEMLLTEMDK
AGVTLRLGVE VKGVAKDDND QGGGYVVSTS RGPIRCASLV VATGGKSIPK MGATGLGYEI
AQQFGLAIQE TRPSLVPLTF EVKTLERLAP LSGVALDAVV ASGKTRFAEA MLFTHRGLSG
PSILQISSFW REGGEISINM APGVDVFAVL KAARTATPKR ALATVLSEML PKRLAQLIAE
DAGAYSNLAD CSDVLLRGVA ATVNAWTFKP VGSEGYRTAE VTLGGVDTDG LDSRTMEAKA
VPGLYFIGEV VDVTGWLGGY NFQWAWASGW CAGQAV