Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3901 |
Symbol | |
ID | 5901363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4220611 |
End bp | 4224450 |
Gene Length | 3840 bp |
Protein Length | 1279 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641564422 |
Product | gene transfer agent (GTA) orfg15, like protein |
Protein accession | YP_001685524 |
Protein GI | 167647861 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.75252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0322249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCAGG TCATTCTCAC CGCCGTCGGC TCGGCCGTGG GCGGTCCGAT CGGCGCGGCG GTCGGCGCGG TGGTCGGCCG CGCCATCGAC AACGCGGCGA TCAACGCCCT GACCCCCGCC CGCCAGGTCG GGCCGCGCAT CCCCGAGCTG CGCCTGTCGG GCGCCGCCGA GGGTGCGCCG ATGGCCGCCG TGTTCGGCCG CGCGCGGGTG GCCGGCCAGG TGATCTGGGC GGCGCGGTTC AAGGAGCGCT GGATCGACGG CAGGACCGGC GGGAGCAAGG GTCCGCGCAC CACCCGCGCC GCCTACAGCC TGTCGTTCGC CGTGGCCGTC TGCGAAGGTC CGATCGACGG GATCGGCCGG GTCTGGGCCG ACGGCAAGCC GATGGACATG GCCGGGGTGG TCATGCGGGT CCACACCGGC GCCGAGGACC AGGCCCCCGA CGCCCTGATC GAGGCGGTGG AAGGAACCGC GCCGGCCTAT CGCGGCACGG CCTATGTGGT GTTCGAGGAC CTGCCGCTGG GTCCCTATGG CGACCGCCCG CCGCAGCTGT CGTTCGAGGT GTTCCACCGG CCGCGCGCCA GCGGCGCGAC GCCGGGTCTG GAAGAGCGCC TGAAGGGCGT CTGCCTGATC CCGGGCGCGG GCGAGTTCGT CTACGCCACC GACCTGGTGC TGCGCCGCGA CGGCCTGACC CGGACCACGG CCGAGACCCT GAACAACAGC GAGGGCCGCC CCGACCTGGT CGTCTCGCTC GACCAGCTTC AGGCCCAGCT GCCGAACGTG GAGGAGGTGA CCTTGGTGGT CGCTTGGTTC GGCGACGATC TGCGCTGCGG ATCGTGCGCG ATCCGGCCCA AGGTCGAGCA GGCGGCCAAG GCGACGATCC CGTTCGACTG GCGGGTGAAC GGCGTCGACC GCGTCCACGC GGCGGTGGTC TCCCAGCATG GCGGCGGTCC GGCCTATGGC GGCACGCCGG CCGACCGGGC GGTGCTGCAG GCCATCGCCG AACTGAAGCG GCGCGGACTG AAGGTGACGC TCTATCCGTT CGTGCTGATG GACGTGCCGG CTGGCAACGC TCTGCCCGAC CCCTACGGCG GGTCGGCCCA GGGGGCCTAT CCGTGGCGCG GCCGGATCAC CTGCCACCCC GCCGCTGGAC GGCCCGGCAC GCCGGACAAG ACCGCCGCCG CCACCGCCCA GGTCTCGGCC TTGTTCGGCG CGGCCACGGC GACGCAGTTC GGGGCGGAGG GCGGCCTGCC GACCTACGGC GGTCCGGCCG GCGACTGGGG GCTGAGGCGC ATGCTGCTGC ACTACGCCAA GCTGGCCCAA CTGGCCGGCG GGGTGGACGG CTTCATCCTC GGTTCGGAGC TGCGGGGCCT GACCACGGTG CGCGACGGGA CCTCGTCCTA TCCTGCCGTG ACCGCGCTGA AGGCCCTGGC CGGCCAGGTG CGGACCTTGC TGGGGTCGAC GACCAAGCTG GGCTATGCCG CCGACTGGAG CGAGTATTTC GGCCATCAGC CGGGCGACGG CTCCGGCGAC GTCCACTTCC ACCTCGACCC GCTGTGGAGC GACGCCAACA TCGATTTCGT CGGGATCGAC TTCTATCCGC CGATGGCCGA CTGGCGGGAC GGCGACGACC ACCTGGACGC CGGGCGAGGC GGTCCGCACG ACCTCGACTA CCTCCGCGCC AACCTGGTAG GCGGCGAAGG CTTCGACTGG TTCTACGCCT CGGGAGCCGC GCGGACGGCG CAGGTCCGCG CGCCGATCAC CGACGGCGCC CATGCCGAGC CCTGGGTGTT TCGGCCCAAG GACCTGCAGG CCTGGTGGAG CCATGCCCAC TACAATCGCC CCGGCGGGGT GCGCGCCGCC ACGCCGACCG CCTGGGTTCC GAGGTCCAAG CCGCTGCGGC TGGTCGAGTT CGGCTGCGGG GCGGTCGACA AGGGCGCCAA CGCGCCCAAC CTGTTCGTCG ATGCGAAGAG CGCCGAGAGC GCCCTGCCGC CGTCCTCGGA CGGGACGCGC GACGAGATCG GCCAGCGGCG GGCGCTGGAG GCGGTGCTGG CCCAAGTGGC CGACCCGGCG ACCAACCCGG TCTCGCCGGT CTATGGCGGA CCGATGATCG ACAGCGCGGC CGCCTGGTGC TGGGACGCCC GGCCGTTCCC CGACTTCCCG GCCCGCGAGG CCGTCTGGGC CGACGGACCC AACTGGACCC TGGGGCACTG GCTGAACGGC CGCGCTGGGA TCGCGCCGCT GCCGGAGCTG ATCGCCGCCT TGGCGCAGCG GGCCGGCGCG ACGATCGATC CAGGCGAGGC CGGGGGCTCG GTGGTCGGCT ATGTGATCGA CCGGCCGATG CGGCTGCGCG ACGCCCTGGC CCCGCTGCTG GAGGTCTTCG CCCTGGACGC GGTCGAGCGG CAGGATGGCG TCGCCCTGGC CGGCCGCTCG GGCGTGGCGG TCCTGACCTT CGGCGACGAC GACCTGGCCT GGCCGGACGA CCGTGACGCC CCGGTCCGCG CCAGCCGGAC CCTGGCCGCT CCGGTCCAGG CCCTGCGCCT GCGCTTCATC GACGCCGCTC GCGACTACCA GACCGGTTCA GTCATCGTCC GACGCGACGC GGGCGAGGGC GGCGCCGATC TCGACGCGCC GGTCGTGCTG TCGGCCGCCG AGGCCCGCGC CGTGGCAGAG CGCCTGTTGG GGACGGGCGA CGGGCGCGAG GTCACGGCCC ACCTGTCGCC CCTGGCCGCC CTGCGCCTGG AGCCCGGCGA CCGCCTGGCC CTCGGCAGCG GCGTCTGGCG CGTGACGCGG ATCGATCTCG ACGAGCACCC CCGCGCCCAG CTGGCGCCGG TGGTCGAGCC CGTGACGGTG GGCGGCGATT TGGACTGGTC GCCCGCCACG CCGCGCGAGA TCCCCGGCCC GCCCGTGCTG CACGTGCTCG ACCTGCCCCT CCTTTCCGGC TTGGGAGGGC AGGACGACGA CCGGCCGCTG GTCGCGGTCG CCGCCTCGCC CTGGCGGGCC TTCGACGTCC AGGCGGGTGT CGGGCTGGAC GCGCTGCGGG GGCGGGCGAC CGCCGCCGTC CCCGCCACGG TTGGCGTCAC CCTGTCGAAC CTGCCGGCCG GGCCGTTGCA CCGCTTCGAT CGCGCCACCC GGCTGACCGT GCGGCTGGAG GGCGCCAGTC CGTCCGGCCG CGACCGGTTC GCTGTCCTGG CCGGGGCCAA CGCCATCGCC GTGCGCGGGG CGGGCGGCGA GTGGGAGATC CTCCAGTTCC TCGACGCCGA AGCGGTGTCG GGCGACGTCT GGACCCTTTC GGGCCTGCTG CGCGGCCAGG CCGGCAGCGA CCCGGCCATG GCGGCCCTGA CCCCGGCCGG CGCGGCGGTG GTGGTGCTGG ACGAGGCCCT GGTCCGGGCC GAGCTGACCC TGTCCGAGCG CGGCCTGCCG CTGGTCTGGC GCGCCGCGCC GGCCGGCGGT CCCGCCTCGG GGCCTTCGAT GAGCGAGGTG GTCGAGACCT GGCGCGGCCT CTCGACCCGG CCCTGGTCGC CCGCGCACCC GCGGGTGCGG ACCCAGGGCG GCGATGCGGT GATCAGCTGG ATCCGCCGCG CCCGCCTGGC TGGAGACGGC TGGGACGCCG AGGTTCCGCT GGGCGAGGAG CGCGAGGTCT ATCGCGTCGA GATCCTGGAC GGCGAAACCG TGGTCCGCGC CGCCGAAACC AGCGTTCCGA CCTGGACTTA CACCGCCGCC CAGCGCGCCG CCGACTTCCC CGCTGGACCA ACCGGGGTCT TGGCTGTCAG GATCGCCCAA GGCTCGGCCC TGTTCGGCTG GGGGGCTTCG GCCCGCGTCC CCTTGGGAGT CTCGCTGTGA
|
Protein sequence | MAQVILTAVG SAVGGPIGAA VGAVVGRAID NAAINALTPA RQVGPRIPEL RLSGAAEGAP MAAVFGRARV AGQVIWAARF KERWIDGRTG GSKGPRTTRA AYSLSFAVAV CEGPIDGIGR VWADGKPMDM AGVVMRVHTG AEDQAPDALI EAVEGTAPAY RGTAYVVFED LPLGPYGDRP PQLSFEVFHR PRASGATPGL EERLKGVCLI PGAGEFVYAT DLVLRRDGLT RTTAETLNNS EGRPDLVVSL DQLQAQLPNV EEVTLVVAWF GDDLRCGSCA IRPKVEQAAK ATIPFDWRVN GVDRVHAAVV SQHGGGPAYG GTPADRAVLQ AIAELKRRGL KVTLYPFVLM DVPAGNALPD PYGGSAQGAY PWRGRITCHP AAGRPGTPDK TAAATAQVSA LFGAATATQF GAEGGLPTYG GPAGDWGLRR MLLHYAKLAQ LAGGVDGFIL GSELRGLTTV RDGTSSYPAV TALKALAGQV RTLLGSTTKL GYAADWSEYF GHQPGDGSGD VHFHLDPLWS DANIDFVGID FYPPMADWRD GDDHLDAGRG GPHDLDYLRA NLVGGEGFDW FYASGAARTA QVRAPITDGA HAEPWVFRPK DLQAWWSHAH YNRPGGVRAA TPTAWVPRSK PLRLVEFGCG AVDKGANAPN LFVDAKSAES ALPPSSDGTR DEIGQRRALE AVLAQVADPA TNPVSPVYGG PMIDSAAAWC WDARPFPDFP AREAVWADGP NWTLGHWLNG RAGIAPLPEL IAALAQRAGA TIDPGEAGGS VVGYVIDRPM RLRDALAPLL EVFALDAVER QDGVALAGRS GVAVLTFGDD DLAWPDDRDA PVRASRTLAA PVQALRLRFI DAARDYQTGS VIVRRDAGEG GADLDAPVVL SAAEARAVAE RLLGTGDGRE VTAHLSPLAA LRLEPGDRLA LGSGVWRVTR IDLDEHPRAQ LAPVVEPVTV GGDLDWSPAT PREIPGPPVL HVLDLPLLSG LGGQDDDRPL VAVAASPWRA FDVQAGVGLD ALRGRATAAV PATVGVTLSN LPAGPLHRFD RATRLTVRLE GASPSGRDRF AVLAGANAIA VRGAGGEWEI LQFLDAEAVS GDVWTLSGLL RGQAGSDPAM AALTPAGAAV VVLDEALVRA ELTLSERGLP LVWRAAPAGG PASGPSMSEV VETWRGLSTR PWSPAHPRVR TQGGDAVISW IRRARLAGDG WDAEVPLGEE REVYRVEILD GETVVRAAET SVPTWTYTAA QRAADFPAGP TGVLAVRIAQ GSALFGWGAS ARVPLGVSL
|
| |