Gene Caul_2785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2785 
Symbol 
ID5900240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3021684 
End bp3023864 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content74% 
IMG OID641563277 
ProductComEC/Rec2-related protein 
Protein accessionYP_001684410 
Protein GI167646747 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.457375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00294021 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCGAGGA CCGCGACGCC GGGCAAGACC GACGGCGGCT GGGCGGCGCG TCTTAGCACG 
CGGATTCGCG CCCGTCCTAG TCCGGCGCGG GTTCTGGCCG CGCTGATCGG CGAGATCGAC
GCCAACCGCG AGCGCTGGAT GCTGTGGTCG CCGGTGGCGT TTGGCCTCGG CGCGGCCGGC
TATCTGGAAC TGAGGACCGA GCCGTCATGG GCGCTGCTGG TCGGGCTAAC GGCGGGGCTG
GCGATCCTGG CGGGGCTGAT CGGACGTCGC TCAACGCGGG GCTTGTCGGT GTTTCTGGTG
CTGGCCACCC TTCTGGCGGC TGGAGCCTTG GCGGGCAAGG TGCGCTCGAA CGCCGTGGCC
GCGCCGATTT TGGTTGGTGA GCGGGCGGTG ATGACCCTCG ACGGCTTCGT GGTCGATGTG
GTCAGCCCCG GAGCGGGCGG GCCGCGCCTG CTGATCGCGC CCGTCGAGAT CAGCCGCCTC
ACGCCGGAGG CCACGCCCAA GCGGGTGCGG GTGACGGTCG AGGCCGAGGA TATCCCGGTG
CCGGGCCAGG CGATCCGCCT GCGGGCCATG CTGGGGCCGC CGCCGCCGCC GGCCGCGCCA
GGAGCCTATG ACTTCGCCCG CGACGCCTGG TTTCATGGCG TGGGCGGGGT GGGCTTCGCG
ATCGGCGAGT CGCGGCCCGT GGTATTGGAT CCGCCGCCGT GGCGGTTGCG GGCGGCCATG
GCGGTCAACG CCTTCCGCTG GCGGCTGGCC AGCCGGATCG TCGCGCGGAT GGGCGCCGAG
CGCGGCGGCG TCGCCGCGGC CATGGTCACC GGCCACGAGG CCTGGATCAC CCAGGAACAG
ACCAACGCCA TGCGCGCCTC GGGCCTGGCC CACATCCTGT CGATCTCGGG CCTGCACATG
GCGATCGTAG GGGGCTTCCT GTTCGGACTG GTGCGGCTAG GGATCGCCGC CTGGCCGTGG
GCGGCGCTGC GGGTTCCGGG CAAGAAGGTC GCGGCCTTGG CGGGTTTGGC GGCGATCGGG
ACCTATCTGG TGATCTCGGG CGCCCCGCCG CCGGCCCTGC GGGCGGCGAT CACCGCCACC
GTGGCCTTCG CGGCCATCCT GTTTGACCGC CAGGCCATCA CCCTGCACGG CCTGGCGATC
GCGGCGCTGG TCATCCTGCT CGTCCAGCCG GAGTCGGCGG GGGCTCCGGG CTTCCAGATG
TCGTTCGCCG CCACCGCCGC CCTGGTCGCC TTGGCCGAGG CCTGGCCGCG GCCGGTGCGC
GAGATCTCGG CGCCCTGGTG GATCCGGGCC ATTCAGGGCT CCATGAGCTG GCTGGCGGTC
AGCATCGGGG CCAGTTTCGT GGCCGGCATG GCGACGGGGC CGTTCGCCAT GCAGCACTTC
AACCGCGTGG CGGTCTGGGG CCTGCCGGCC AATCTGGCGG TCGCGCCGCT GTCGTCGTTC
GTGATCATGC CGTTCCTGGC GATCGGCGCG GCGCTGGAGC CCTTCGGCCT GGGCGGACCG
TTCCTGGCGA TCGCCGGCTG GGGCATCGGG GCGATGATGT GGATCGCCGA CGGCTTCGCC
AGCGCCGGCG GAGCGCAGCG GCTGGTCGCC AGCGGGCCGC CCTTCACCCT GGCCCTGGCG
TTCGTGGGAT TGATGCTGCT GTGCCTGTGG CGCGGCCGCC TGCGCTGGCT GGGCGCGCCC
CTGGCCCTGG CGGTGGCCCT GTGGCCGCGC GCGGCGCCGC CCGATGTCTG GATCGCCCCG
GACGGCAGCA CGGCGGCCGT GCGCCAAGGC CGCGAGGCGG TGCTGCTGCG CCCCGACGCC
CGGCGGTTCG GAGCCGAGTT GTGGAGTCGT CGACGCGGCC TGGCCGCCCC GAGCGACACC
AAGCCCAGCG CGCTCTACGC CTGCGACCGG CGATCCTGCG TACCGACCGC TGCCTCGCCG
GTCGCCCTGG CCCTGTCCTG GAGCAAGGCC GCGCCGGACG CCGAGGCGTT GGCGGCGATG
TGCGTCGAGG CCGAGATCGT CGTGCTGCGC GGCGCCGCGC CGGTCACGCC ACCGGCCTGC
CGCGACCGGA TCGTGCTCGA CGCCGACGAT TTCGCCGTCG GCGGGGCGGT CGAGCTCTAT
CGTCGGGGCG GCGACTGGTG GATCGTCTGG GCGCAGCCGT TGCGGGGCGT TCGGCCCTGG
ACGCGCGCGG CGGGGAGCTA G
 
Protein sequence
MARTATPGKT DGGWAARLST RIRARPSPAR VLAALIGEID ANRERWMLWS PVAFGLGAAG 
YLELRTEPSW ALLVGLTAGL AILAGLIGRR STRGLSVFLV LATLLAAGAL AGKVRSNAVA
APILVGERAV MTLDGFVVDV VSPGAGGPRL LIAPVEISRL TPEATPKRVR VTVEAEDIPV
PGQAIRLRAM LGPPPPPAAP GAYDFARDAW FHGVGGVGFA IGESRPVVLD PPPWRLRAAM
AVNAFRWRLA SRIVARMGAE RGGVAAAMVT GHEAWITQEQ TNAMRASGLA HILSISGLHM
AIVGGFLFGL VRLGIAAWPW AALRVPGKKV AALAGLAAIG TYLVISGAPP PALRAAITAT
VAFAAILFDR QAITLHGLAI AALVILLVQP ESAGAPGFQM SFAATAALVA LAEAWPRPVR
EISAPWWIRA IQGSMSWLAV SIGASFVAGM ATGPFAMQHF NRVAVWGLPA NLAVAPLSSF
VIMPFLAIGA ALEPFGLGGP FLAIAGWGIG AMMWIADGFA SAGGAQRLVA SGPPFTLALA
FVGLMLLCLW RGRLRWLGAP LALAVALWPR AAPPDVWIAP DGSTAAVRQG REAVLLRPDA
RRFGAELWSR RRGLAAPSDT KPSALYACDR RSCVPTAASP VALALSWSKA APDAEALAAM
CVEAEIVVLR GAAPVTPPAC RDRIVLDADD FAVGGAVELY RRGGDWWIVW AQPLRGVRPW
TRAAGS