Gene Caul_3500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3500 
Symbol 
ID5900955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3779149 
End bp3781848 
Gene Length2700 bp 
Protein Length899 aa 
Translation table11 
GC content61% 
IMG OID641564006 
Producthypothetical protein 
Protein accessionYP_001685125 
Protein GI167647462 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.512789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATCA CCACCAACGA AATGCTTGCC GATCTTGGCG GGCGCATTGT CGATCTCATC 
CAAGAGAATC TGACGCCCCC GCAGTTCATC ACCGCGCTTA GCGGTCTGGC GACCAATTGG
GATGGTGGAA CACTGTCAGC CGTTGGTCTG GGGCGCCAAC TCTCCGACAG CATCGAAGCT
CGTAACCGCT GGGTCGATCA GGCTTCCGCA TACTTCCAGG GCACCGCGAC AGGCGGACCA
AACAGCGACG GCAAGTACCC GTTCACGACA CGCACAGGCG CGACCGTTTC GATGGAATGT
CCAGCCAAAC TAGCAGCCAT GGTCACGGGC CCATCGGAGA GCGCTCAAGC CTATGCTGCA
TCTGCTCTTG CTGCTCGCGA TGTCATTCTA GGGAAGGTCG CTGAAGCAGC GGCATCCGCG
ACCGCAGCGG CAGGCAGCGC TACGGCGTCA GCAGGTAGCG CAACAGCGGC AGCAGCTAGC
GCCACGACAG CCGGTACGGC GAAGACCGCT TCTGAGACAG CTCGCGATGT CACGCAGGGC
TATCGAGACA CCACCCTAAC GGCCAAGACC GCGACCGAGA CGGCACGCGA TCTAACCCTC
ACATACCGCG ACGGCGCACT AACGGCTAAG GACGCCGCGG TGACTGCTAA GACGGCTTCT
GAGAGCGCGC GTGACGCAGC TATCGCAGCG GCATCGTCCG TTGATACCAC GGCCATCAAC
AACAACCTGG CCCTCAAGTT CGACAAGGGC GGCGGGACGC TGACAGGCGA CCTCTACCTG
ACGGCCAGCG GTTCGCAGTC TCCCGGCTTC CACCTCAAGG CAAACGGCAT GGGAACCGAC
GCGAAGATCA GCCGCGCCTA CTCACAGGGC GGACTCTTCA CGTTGGACTT CGTGAACGAC
GCCTACACGG CGGCGCAGCC ATTCCTGACC GTTGGACGCT CGGGCACCAC GCCTCAGGCA
ATCAACCTGT TCGGTACATC GCTGAGCTTC AACAGCACGG CGATCACGAC CGCAACGACT
CTGGCGAGCG GACTAGCGAC CAAGCAAAAC ACCCTCGGCT TCACACCGGT TCAACAAGGT
ACAGGCGTCG GTCAAACCAC CAACGTAATC AAGTTGGGCT GGTCCAACGA GGGCAAGCTG
AAGGCAACGA TCGATGCGAC TGACCAGGGC GCTATCGTGT TTGAAAGCGC GCTGACTTGG
ACAAACCTAT CCGGCAAGCC GTCGAGCTTC GCGTCGGATT GGTCCACGCT CACAAGCAAG
CCATCGACCT TTGCCCCATC GGCTCACACT CACGTCATTG CCGATACGAC CGGACTTCAG
GCCGCGCTTG ATGCGAAGCT CGCGACCACA GGGTTCACCT ACACAGCGCT ACCGGGCAAG
CCGTCGCTCT ACCCAACCGA CACGGCCAAC GTCTCAGGTC TGACAGCAGC CCTCGCACTC
AAGGCAGACG CCTCGGCGTT GACTAGCGGT CTTGCCGCCA AGGCTCCAAT CGCCAGTCCG
CAATTCACGG GCACGGCTAA CATCACAGGT GCAGCAGCAA CGACACGACT TTTTGGAGCT
CAGACAGCTG GTGTCCTGCG TTGGATGTGG GGTGCGGCTG CAGACACCGA AAGCGGCTCC
AATGCGGGTT CAAATTGGGC GCTCTACAGC TACGCGGACA ATGGCGCGTT TATCGGTACT
CCGATCTCCG TGACCCGTGC GTCCGGCGCA GTTACGTTCG CGGGAGCCGC GACGTTCAAC
AGCACCGTAT CCATCGGCGG CGCTACGCCT TGGACATCGG CCAACTTCAC GCCTTCCACC
AAGCTCGATA CGTCGGCCTT CACGTGGGCA AACCTGAGCA GCAAACCAAC GACATTTGCG
CCTTCGGCTC ACACTCACGC CACGTCTGAG ATTACCGGCT TAGATACTGC CCTTGCGGGG
AAAGCTGCCC TGTCGGGCGC GACCTTCACT GGTGCAGTCG CCATGAACTC GACCCTCGCG
GTCAGTGGAA ACGTCCGTGC GATTAGCACG GGCGGCACGG GACAGTTTAC CGCTGTCAGC
GGCAATACAA TGGCCGGGTT CTACCAAGAC AGCACGAATT TCTACCTTCT GAAGAGCGCG
ACTTCTAACA CCACGTTCGA CGGGCATCGA CCTATTGTGG TCAGCCTCTC GACGGGTTCG
GTGACCATCG ACGGCACAGG CGCTGGCGGC ACTCAGGTTG GCGGTACATT GGGCGTCACC
GGCACGCTCA ATTGCGCCGG TGAAATCTAC ACACCCGGCT GGATCAGGCT AACGGGCAAC
CAGGGCATGT ACTGGAATGC CTGGGGCGGC GGTTGGACGA TGACCGACAG CACGTGGATG
CGGTCATATG GCGACAAGTC CATCCTGACG GGCGGCAACA TCCAGTGCGC GATGTGGACC
GTGACTTCCG ACGAACGCCT GAAGACCGAC ATCAAGCCGC TGACCAACGG CAGCGAGATC
ATCTACGGAA CGAACGTCTA TTCGTTCATC AAGGGCGGTC AACGCATGTG GGGTGTCCTG
GCTCAAGAAG CCCAGGCAAA TCCCCTCACT GAGGTTCTAG TCAACGAAGG CGGGCAGCTT
CTACCAGACG GCAGCGGCAA CGCCCTGACC GTCGATAGCA TGGGCTACGT CTATGCCCTG
ATCGACACGG TCAAGGAGCA GAACGCTCGC ATTGCGGCGC TAGAAGCGAG GCTCGCATAA
 
Protein sequence
MTITTNEMLA DLGGRIVDLI QENLTPPQFI TALSGLATNW DGGTLSAVGL GRQLSDSIEA 
RNRWVDQASA YFQGTATGGP NSDGKYPFTT RTGATVSMEC PAKLAAMVTG PSESAQAYAA
SALAARDVIL GKVAEAAASA TAAAGSATAS AGSATAAAAS ATTAGTAKTA SETARDVTQG
YRDTTLTAKT ATETARDLTL TYRDGALTAK DAAVTAKTAS ESARDAAIAA ASSVDTTAIN
NNLALKFDKG GGTLTGDLYL TASGSQSPGF HLKANGMGTD AKISRAYSQG GLFTLDFVND
AYTAAQPFLT VGRSGTTPQA INLFGTSLSF NSTAITTATT LASGLATKQN TLGFTPVQQG
TGVGQTTNVI KLGWSNEGKL KATIDATDQG AIVFESALTW TNLSGKPSSF ASDWSTLTSK
PSTFAPSAHT HVIADTTGLQ AALDAKLATT GFTYTALPGK PSLYPTDTAN VSGLTAALAL
KADASALTSG LAAKAPIASP QFTGTANITG AAATTRLFGA QTAGVLRWMW GAAADTESGS
NAGSNWALYS YADNGAFIGT PISVTRASGA VTFAGAATFN STVSIGGATP WTSANFTPST
KLDTSAFTWA NLSSKPTTFA PSAHTHATSE ITGLDTALAG KAALSGATFT GAVAMNSTLA
VSGNVRAIST GGTGQFTAVS GNTMAGFYQD STNFYLLKSA TSNTTFDGHR PIVVSLSTGS
VTIDGTGAGG TQVGGTLGVT GTLNCAGEIY TPGWIRLTGN QGMYWNAWGG GWTMTDSTWM
RSYGDKSILT GGNIQCAMWT VTSDERLKTD IKPLTNGSEI IYGTNVYSFI KGGQRMWGVL
AQEAQANPLT EVLVNEGGQL LPDGSGNALT VDSMGYVYAL IDTVKEQNAR IAALEARLA