Gene Caul_3901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3901 
Symbol 
ID5901363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4220611 
End bp4224450 
Gene Length3840 bp 
Protein Length1279 aa 
Translation table11 
GC content75% 
IMG OID641564422 
Productgene transfer agent (GTA) orfg15, like protein 
Protein accessionYP_001685524 
Protein GI167647861 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.75252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0322249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGG TCATTCTCAC CGCCGTCGGC TCGGCCGTGG GCGGTCCGAT CGGCGCGGCG 
GTCGGCGCGG TGGTCGGCCG CGCCATCGAC AACGCGGCGA TCAACGCCCT GACCCCCGCC
CGCCAGGTCG GGCCGCGCAT CCCCGAGCTG CGCCTGTCGG GCGCCGCCGA GGGTGCGCCG
ATGGCCGCCG TGTTCGGCCG CGCGCGGGTG GCCGGCCAGG TGATCTGGGC GGCGCGGTTC
AAGGAGCGCT GGATCGACGG CAGGACCGGC GGGAGCAAGG GTCCGCGCAC CACCCGCGCC
GCCTACAGCC TGTCGTTCGC CGTGGCCGTC TGCGAAGGTC CGATCGACGG GATCGGCCGG
GTCTGGGCCG ACGGCAAGCC GATGGACATG GCCGGGGTGG TCATGCGGGT CCACACCGGC
GCCGAGGACC AGGCCCCCGA CGCCCTGATC GAGGCGGTGG AAGGAACCGC GCCGGCCTAT
CGCGGCACGG CCTATGTGGT GTTCGAGGAC CTGCCGCTGG GTCCCTATGG CGACCGCCCG
CCGCAGCTGT CGTTCGAGGT GTTCCACCGG CCGCGCGCCA GCGGCGCGAC GCCGGGTCTG
GAAGAGCGCC TGAAGGGCGT CTGCCTGATC CCGGGCGCGG GCGAGTTCGT CTACGCCACC
GACCTGGTGC TGCGCCGCGA CGGCCTGACC CGGACCACGG CCGAGACCCT GAACAACAGC
GAGGGCCGCC CCGACCTGGT CGTCTCGCTC GACCAGCTTC AGGCCCAGCT GCCGAACGTG
GAGGAGGTGA CCTTGGTGGT CGCTTGGTTC GGCGACGATC TGCGCTGCGG ATCGTGCGCG
ATCCGGCCCA AGGTCGAGCA GGCGGCCAAG GCGACGATCC CGTTCGACTG GCGGGTGAAC
GGCGTCGACC GCGTCCACGC GGCGGTGGTC TCCCAGCATG GCGGCGGTCC GGCCTATGGC
GGCACGCCGG CCGACCGGGC GGTGCTGCAG GCCATCGCCG AACTGAAGCG GCGCGGACTG
AAGGTGACGC TCTATCCGTT CGTGCTGATG GACGTGCCGG CTGGCAACGC TCTGCCCGAC
CCCTACGGCG GGTCGGCCCA GGGGGCCTAT CCGTGGCGCG GCCGGATCAC CTGCCACCCC
GCCGCTGGAC GGCCCGGCAC GCCGGACAAG ACCGCCGCCG CCACCGCCCA GGTCTCGGCC
TTGTTCGGCG CGGCCACGGC GACGCAGTTC GGGGCGGAGG GCGGCCTGCC GACCTACGGC
GGTCCGGCCG GCGACTGGGG GCTGAGGCGC ATGCTGCTGC ACTACGCCAA GCTGGCCCAA
CTGGCCGGCG GGGTGGACGG CTTCATCCTC GGTTCGGAGC TGCGGGGCCT GACCACGGTG
CGCGACGGGA CCTCGTCCTA TCCTGCCGTG ACCGCGCTGA AGGCCCTGGC CGGCCAGGTG
CGGACCTTGC TGGGGTCGAC GACCAAGCTG GGCTATGCCG CCGACTGGAG CGAGTATTTC
GGCCATCAGC CGGGCGACGG CTCCGGCGAC GTCCACTTCC ACCTCGACCC GCTGTGGAGC
GACGCCAACA TCGATTTCGT CGGGATCGAC TTCTATCCGC CGATGGCCGA CTGGCGGGAC
GGCGACGACC ACCTGGACGC CGGGCGAGGC GGTCCGCACG ACCTCGACTA CCTCCGCGCC
AACCTGGTAG GCGGCGAAGG CTTCGACTGG TTCTACGCCT CGGGAGCCGC GCGGACGGCG
CAGGTCCGCG CGCCGATCAC CGACGGCGCC CATGCCGAGC CCTGGGTGTT TCGGCCCAAG
GACCTGCAGG CCTGGTGGAG CCATGCCCAC TACAATCGCC CCGGCGGGGT GCGCGCCGCC
ACGCCGACCG CCTGGGTTCC GAGGTCCAAG CCGCTGCGGC TGGTCGAGTT CGGCTGCGGG
GCGGTCGACA AGGGCGCCAA CGCGCCCAAC CTGTTCGTCG ATGCGAAGAG CGCCGAGAGC
GCCCTGCCGC CGTCCTCGGA CGGGACGCGC GACGAGATCG GCCAGCGGCG GGCGCTGGAG
GCGGTGCTGG CCCAAGTGGC CGACCCGGCG ACCAACCCGG TCTCGCCGGT CTATGGCGGA
CCGATGATCG ACAGCGCGGC CGCCTGGTGC TGGGACGCCC GGCCGTTCCC CGACTTCCCG
GCCCGCGAGG CCGTCTGGGC CGACGGACCC AACTGGACCC TGGGGCACTG GCTGAACGGC
CGCGCTGGGA TCGCGCCGCT GCCGGAGCTG ATCGCCGCCT TGGCGCAGCG GGCCGGCGCG
ACGATCGATC CAGGCGAGGC CGGGGGCTCG GTGGTCGGCT ATGTGATCGA CCGGCCGATG
CGGCTGCGCG ACGCCCTGGC CCCGCTGCTG GAGGTCTTCG CCCTGGACGC GGTCGAGCGG
CAGGATGGCG TCGCCCTGGC CGGCCGCTCG GGCGTGGCGG TCCTGACCTT CGGCGACGAC
GACCTGGCCT GGCCGGACGA CCGTGACGCC CCGGTCCGCG CCAGCCGGAC CCTGGCCGCT
CCGGTCCAGG CCCTGCGCCT GCGCTTCATC GACGCCGCTC GCGACTACCA GACCGGTTCA
GTCATCGTCC GACGCGACGC GGGCGAGGGC GGCGCCGATC TCGACGCGCC GGTCGTGCTG
TCGGCCGCCG AGGCCCGCGC CGTGGCAGAG CGCCTGTTGG GGACGGGCGA CGGGCGCGAG
GTCACGGCCC ACCTGTCGCC CCTGGCCGCC CTGCGCCTGG AGCCCGGCGA CCGCCTGGCC
CTCGGCAGCG GCGTCTGGCG CGTGACGCGG ATCGATCTCG ACGAGCACCC CCGCGCCCAG
CTGGCGCCGG TGGTCGAGCC CGTGACGGTG GGCGGCGATT TGGACTGGTC GCCCGCCACG
CCGCGCGAGA TCCCCGGCCC GCCCGTGCTG CACGTGCTCG ACCTGCCCCT CCTTTCCGGC
TTGGGAGGGC AGGACGACGA CCGGCCGCTG GTCGCGGTCG CCGCCTCGCC CTGGCGGGCC
TTCGACGTCC AGGCGGGTGT CGGGCTGGAC GCGCTGCGGG GGCGGGCGAC CGCCGCCGTC
CCCGCCACGG TTGGCGTCAC CCTGTCGAAC CTGCCGGCCG GGCCGTTGCA CCGCTTCGAT
CGCGCCACCC GGCTGACCGT GCGGCTGGAG GGCGCCAGTC CGTCCGGCCG CGACCGGTTC
GCTGTCCTGG CCGGGGCCAA CGCCATCGCC GTGCGCGGGG CGGGCGGCGA GTGGGAGATC
CTCCAGTTCC TCGACGCCGA AGCGGTGTCG GGCGACGTCT GGACCCTTTC GGGCCTGCTG
CGCGGCCAGG CCGGCAGCGA CCCGGCCATG GCGGCCCTGA CCCCGGCCGG CGCGGCGGTG
GTGGTGCTGG ACGAGGCCCT GGTCCGGGCC GAGCTGACCC TGTCCGAGCG CGGCCTGCCG
CTGGTCTGGC GCGCCGCGCC GGCCGGCGGT CCCGCCTCGG GGCCTTCGAT GAGCGAGGTG
GTCGAGACCT GGCGCGGCCT CTCGACCCGG CCCTGGTCGC CCGCGCACCC GCGGGTGCGG
ACCCAGGGCG GCGATGCGGT GATCAGCTGG ATCCGCCGCG CCCGCCTGGC TGGAGACGGC
TGGGACGCCG AGGTTCCGCT GGGCGAGGAG CGCGAGGTCT ATCGCGTCGA GATCCTGGAC
GGCGAAACCG TGGTCCGCGC CGCCGAAACC AGCGTTCCGA CCTGGACTTA CACCGCCGCC
CAGCGCGCCG CCGACTTCCC CGCTGGACCA ACCGGGGTCT TGGCTGTCAG GATCGCCCAA
GGCTCGGCCC TGTTCGGCTG GGGGGCTTCG GCCCGCGTCC CCTTGGGAGT CTCGCTGTGA
 
Protein sequence
MAQVILTAVG SAVGGPIGAA VGAVVGRAID NAAINALTPA RQVGPRIPEL RLSGAAEGAP 
MAAVFGRARV AGQVIWAARF KERWIDGRTG GSKGPRTTRA AYSLSFAVAV CEGPIDGIGR
VWADGKPMDM AGVVMRVHTG AEDQAPDALI EAVEGTAPAY RGTAYVVFED LPLGPYGDRP
PQLSFEVFHR PRASGATPGL EERLKGVCLI PGAGEFVYAT DLVLRRDGLT RTTAETLNNS
EGRPDLVVSL DQLQAQLPNV EEVTLVVAWF GDDLRCGSCA IRPKVEQAAK ATIPFDWRVN
GVDRVHAAVV SQHGGGPAYG GTPADRAVLQ AIAELKRRGL KVTLYPFVLM DVPAGNALPD
PYGGSAQGAY PWRGRITCHP AAGRPGTPDK TAAATAQVSA LFGAATATQF GAEGGLPTYG
GPAGDWGLRR MLLHYAKLAQ LAGGVDGFIL GSELRGLTTV RDGTSSYPAV TALKALAGQV
RTLLGSTTKL GYAADWSEYF GHQPGDGSGD VHFHLDPLWS DANIDFVGID FYPPMADWRD
GDDHLDAGRG GPHDLDYLRA NLVGGEGFDW FYASGAARTA QVRAPITDGA HAEPWVFRPK
DLQAWWSHAH YNRPGGVRAA TPTAWVPRSK PLRLVEFGCG AVDKGANAPN LFVDAKSAES
ALPPSSDGTR DEIGQRRALE AVLAQVADPA TNPVSPVYGG PMIDSAAAWC WDARPFPDFP
AREAVWADGP NWTLGHWLNG RAGIAPLPEL IAALAQRAGA TIDPGEAGGS VVGYVIDRPM
RLRDALAPLL EVFALDAVER QDGVALAGRS GVAVLTFGDD DLAWPDDRDA PVRASRTLAA
PVQALRLRFI DAARDYQTGS VIVRRDAGEG GADLDAPVVL SAAEARAVAE RLLGTGDGRE
VTAHLSPLAA LRLEPGDRLA LGSGVWRVTR IDLDEHPRAQ LAPVVEPVTV GGDLDWSPAT
PREIPGPPVL HVLDLPLLSG LGGQDDDRPL VAVAASPWRA FDVQAGVGLD ALRGRATAAV
PATVGVTLSN LPAGPLHRFD RATRLTVRLE GASPSGRDRF AVLAGANAIA VRGAGGEWEI
LQFLDAEAVS GDVWTLSGLL RGQAGSDPAM AALTPAGAAV VVLDEALVRA ELTLSERGLP
LVWRAAPAGG PASGPSMSEV VETWRGLSTR PWSPAHPRVR TQGGDAVISW IRRARLAGDG
WDAEVPLGEE REVYRVEILD GETVVRAAET SVPTWTYTAA QRAADFPAGP TGVLAVRIAQ
GSALFGWGAS ARVPLGVSL