Gene Caul_3471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3471 
Symbol 
ID5900926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3750517 
End bp3753201 
Gene Length2685 bp 
Protein Length894 aa 
Translation table11 
GC content63% 
IMG OID641563977 
Producthypothetical protein 
Protein accessionYP_001685096 
Protein GI167647433 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAACA CAGTAGTCCT CATCACGACG ACAGGATCGG GCACTTGGAC GCCTCCCGCA 
GGCGTGACAT CTGCCATGAT TGAATGCTGG GGTGGTGGTG CGGGTTCGTC GCAAACCGGA
TCGAACAGCT TCATCGGTGG TGGTGGTGGT GCGTATTCGG CTTCCACGAT TTCAGTCACG
CCCGGTACGC CGGTCTCATA TCTCGTCGGC GCAGGTAGCG CGACAGTCAC CGGGTACTCG
TCTGCACCGC TTTACGCGAG CGCTGGTGGT GACACATGGT TCAGCAGTTC ATCGACTCTC
CTAGCCAAGG GCGCGCAAGC GGCCTCAGGC TCGACCTCAT TAGGTGGCGC AGCAGCTTCA
GGCGTCGGCA CAACCAAATA CTCAGGCGGT GACGGAGCCT ATCAATCTAG CGGCGCTGGT
GGTGCTGGTG GTGGTGCTTC GGCGTCTCCA ACTGGCAACG GTGGCAACGC AAGCGCCTCG
ACCGGCGGCT CTTCACCCGG TGCTGGTGCT GGTGGCGCTC CGTCATCGAC CTTCCCCTTC
ACCGGTAATC CCGGCGCGTC GAACGTGGAC GGTGGTGGTG GTGATGGTGG ACACCTCAAT
TGGGTTTCGA CCGGTGGTTC TGCTGGTGCT CCCGGTGGTG GTGGTGCTGG TGCGGTTACT
GGTGTTCCTG GTGCTCGCGG CCAAATCCGC ATCACCTACG CCGTGGCATC TTCAGGTGCG
ACGGGTACGG GTTCAGCAAC TCTTCCCGGT CCCACCGCAT CCGCGACCGG ACGACTCGCC
CTCACTGGTA GCGCAGCGAG TACCCTAGCC GCTCCAACTG CATCGGCCTC AGGGACAAGC
ACGACATCAG GTGGTGGTGG TGGTGGTGGT GGTGGCACGA CCATCATGGC GCTTACGACG
GTAGGCACGG GCTCTTGGAC CGTTCCATCG GGCGTCACGT CTATCATTGC CGAATTGATC
GGTGGTGGTG GTGGTAACTC AACGAACCTG GGCGCTGGTG GCGGTGCTTA TTCGAAGAAG
ACTCTATCGG TCACAGCGGG TTCGAGCATC AGTTTCAACG TCGGTGTGGG CGGTGCTCCT
GAAACACCGG TCAACAATTA CGACGCCACG GATGGCGGCG ACACATGGTT CGTGAACAAC
ACGACGGCTC TCGCTAAGGG CGGCTGGGGT GGTTCATCCA CGCACGGCTG GTCACAGGGG
CTAGGTGGTT CAGCAGCCAA CAGCATCGGT GACATCACAT ATAGCGGCGG CAACGCGGCA
GTCGGTGGTG GTGCTGCGGC TGGTCCGCTT GGAGCGGGTG CAGCAGGAGA CGGCGGGACA
AACAAGGGTG GTATTGCCGT TGGCAGTTGG ACCCTATCAG CCGGTGGAAC AGTCGCGCCC
GGCAGTGGTG GTGACAGCGG CGGGCCAGCG GTCAACGTGG GTAAGGCCTA TGGTGGTGGC
GCCGGTGCGA TCAACTATGG TGACGTGAGC ACTGGCGGCG GCCAGGGTCT TATCGTGATC
ACCTACACGG TCGCAGGAGC AGGCCCCTCG CCAATCACCG GCACGGGTGC AGCAACCATC
CCAGCGCTCT CGGCAGCATC CACGGGCCGA CTTGCCCTCA CGGGTAGCGC AGCAAGCACT
TTGGCCGTTC TCACGGCTTC GGGCGTGGGC TCAAGCTCCA CCACGGGTAC GGGCTCTGCA
ACGATCCCAG GTCCATCGGC TGCATCCACC GGGCAACTCG GCCTGTCAGG CACCGCTACG
AGCGTCCTAG GGGCTCCTAC GGCGTCTGCA GTCGGCACGC ACTCAATGCG CGGCACGGCG
ACCTTTGCGC TCATCACGCC CATAAGCGCC GGTACGGGCA AGCTGTCGCT AGCTGGCACG
GGTTCAGCAG TTCTATTGGC CCTGTCGGGT TCCGGCGTTG GACAACTCGG CCAGCCCGCA
ACAGGCAGCG GCACCCTGGA TGCTCCAACG GCATCGGGTA TCGGCTCAAG CTCCCTCACG
GGTTATGGCG ATGGCCAGTT GTCGGCGCTT ACCGGAAGCG CGTTGGGTGG TGGTTCAATT
CCACCGCAAA CAGGAACGGG TTCAGCGACT CTCTTGGCGC TATCGGGTTC GGGTCTATCT
CGTCTCGGTC TCACGGGCAT TGGAGCAGGA ACCCTACAGA AGCCCCTAGG CGCTGGTCAG
GGCAGCTTGG GTCAACGCGG TACGGCAACG GGCTCTTTGA CCCTGCTAGG GGCTTCGTCG
GGTACTCTAA AGCAATCCGG CGTAGGTACG GCCATTCTCA CCGCTCCCCT GGCGTCTGGT
GCGGGCCAAC AAACCGTCAG CGGTGTCGGT GCATCGACCC TGCCTGGTCC TACGGGATCT
AGCGCAGGCG ACATCATCCT AGCCGGTCTC GGTGAAGGCC TCCTCGATGC GCCCCAAGCA
GATGGACAGG GCGAGCTTCG ACTTGTCGGT CAGGGGGCAG GCTTGCTCGA TGGACTGTCG
GCGTATGGCT ACAGCGATGT GACGTTCAAG CCCTCGCCAC TTCGCACCTT CGTTCCAACG
AGCGAGGTCC GATCCATCGC AGCAGAGGCA GAAGGACGCT CGATCATTCC CGACATGGAA
AGCCGCGTCA TGACGATGGG TGAAGAGGTC CGCCGCGTAA CAGCACAGGC CGAAGGGCGA
ACGGTAAATA CGACATCGGA GAGCCGCCTA GCGGTCTCAA AATAA
 
Protein sequence
MANTVVLITT TGSGTWTPPA GVTSAMIECW GGGAGSSQTG SNSFIGGGGG AYSASTISVT 
PGTPVSYLVG AGSATVTGYS SAPLYASAGG DTWFSSSSTL LAKGAQAASG STSLGGAAAS
GVGTTKYSGG DGAYQSSGAG GAGGGASASP TGNGGNASAS TGGSSPGAGA GGAPSSTFPF
TGNPGASNVD GGGGDGGHLN WVSTGGSAGA PGGGGAGAVT GVPGARGQIR ITYAVASSGA
TGTGSATLPG PTASATGRLA LTGSAASTLA APTASASGTS TTSGGGGGGG GGTTIMALTT
VGTGSWTVPS GVTSIIAELI GGGGGNSTNL GAGGGAYSKK TLSVTAGSSI SFNVGVGGAP
ETPVNNYDAT DGGDTWFVNN TTALAKGGWG GSSTHGWSQG LGGSAANSIG DITYSGGNAA
VGGGAAAGPL GAGAAGDGGT NKGGIAVGSW TLSAGGTVAP GSGGDSGGPA VNVGKAYGGG
AGAINYGDVS TGGGQGLIVI TYTVAGAGPS PITGTGAATI PALSAASTGR LALTGSAAST
LAVLTASGVG SSSTTGTGSA TIPGPSAAST GQLGLSGTAT SVLGAPTASA VGTHSMRGTA
TFALITPISA GTGKLSLAGT GSAVLLALSG SGVGQLGQPA TGSGTLDAPT ASGIGSSSLT
GYGDGQLSAL TGSALGGGSI PPQTGTGSAT LLALSGSGLS RLGLTGIGAG TLQKPLGAGQ
GSLGQRGTAT GSLTLLGASS GTLKQSGVGT AILTAPLASG AGQQTVSGVG ASTLPGPTGS
SAGDIILAGL GEGLLDAPQA DGQGELRLVG QGAGLLDGLS AYGYSDVTFK PSPLRTFVPT
SEVRSIAAEA EGRSIIPDME SRVMTMGEEV RRVTAQAEGR TVNTTSESRL AVSK