Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4077 |
Symbol | |
ID | 5901539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4417899 |
End bp | 4420778 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564598 |
Product | cyclic nucleotide-binding protein |
Protein accession | YP_001685700 |
Protein GI | 167648037 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGA CCAACGACAA GGGGGCTCTC CTAATGATCA AGACCTTGGG TCTGCGCATC GGCGCATGCG CTTTGCTGGC CACCACGGCC CTGGTTTCCA TCGTCCCGGC GACGGCCCAG GCTCAGGCCG CCGCGCAGGT GACCTTTGAC ATCCCGGCCG GCGACACCGC CACGGCCCTG AACGCCTTCT CGCGCCAGGC CGGCGTGCAA CTGATGTTCC CCTATGACGT GGCCGCTCGT CACCGCACGG CGGGGCTGAA GGGCGCCTAT GGCCGCGAGG AGGCCCTGCG CCGGCTGATC GACGGCGGCG AGCTGGAGAT CGCCTCGGCC ACCGCCCAGG TCATCACCCT GCGCGAGAAG AGCGCGGGCC CTTTGGCCGC CGCGCCAGCT GACATGGTCG GCGAGATCAT CGTCACCTCG CGCGCCGGCT CGGACGTCCG CACCCGGCTG GAGACCAGCT ACGCCGTCAC CACCATGAGC GCGGAGTCGC TGCGCCTGCG CTCGCCGATG GGCGTGGCCG ACGCGCTGAA GGCCGTGCCC GGCTTCTGGG TCGAGGCCTC GGGCGGCGAA GCCAGCGCCA ACATCCGGGC GCGCGGCATC CCGCAGGAAG GCTATTCGGC CATCGCCCTG CAGGAGGACG GCGCGCCGAT CCAGCACGAC GGCGGCCTGG GCTACCTCAA CGCCGACCAG TCGTTCCGCC TGGACGAGAC CATCGATCGC ATCGAAGTGG TGCGCGGCGG CCCCTCGTCG ATCTTCGCCT CCTACGCCCC CGGCGGTACG GTCAACTTCA TCACCCGCAA GGCGACCGAC ACGCCCGAGG GCCTGGCCAA GGTCCAGGTC AGCGACTACG GGACCAAGCG CGTCGATCTG TTCTACGGCG GCCCGGTCGG CGGCGGCTGG CACGCCTCGG TCGGCGGCTT CTGGCGTGAA GAGGACGGCA TCCGCGACCC GGGCTTCACG GCCAACAAGG GCTATCAATG GCGCGTCGAG GCCGGCCGCG CCTTCGAGCG CGGCAGCATC GAGTTCAACC TCAAGCACCT GGACGACAAC GTCATCCTGT TCGCCGGCGT GCCGGTGAAA TTCAACGCCG CGGGCGAGCC CAGCGCCGCC CCCGGGTTCG ATCCGCTGAC CGGCACCCTG GCCGGCCCGG AAACCGCGCA TCTGACCCTG CGCGGCCCCA CCGGCCCGTT CAACTGGGAC CTGACGCGCG GCACCGAGGT CGAGCTGACC CAGGCCACGG CGGTGTTCAA GTACGAGCCG TTCGACGGCT GGCATTTCCA GGACACGCTG CGCTACCGCA CCTCGGAGTC CAAGCGCATC GGCCTGTTCC CCAACACCCC GGTGCTGGGA ACGAAGCGGA TTGACCAGGT GACGACCGAC TTCCTGAAGG CCAACAACAA CGCCGCCAGC CTGGGTAACG CCGCCAGCCT GGGTCAGGTC ATTCCGGGCG CGGTCGGGCT GCAACTGCGC TACAGCACCA CGGGCGAGGT CTTCAACACC GCCGGCGCCG GCCAGAACGG CAACGGCCTC GTGCTCGACG GCTCGCTGCG CTACGTCTCC GTGCCGCTGG ACGAGCTGAT CAACGACGCC CGGGTGCTGC ACAAGTTCGA GATGGGCGAC CAGACCCACG ACGTGGCCTT CGGCGTCTAC ACCGCCCACG TCGAGGAAGC GTTCAACCGC TACTCGGCCA ACACCCTGCT CGACGTGCAG AGCAACGCCC GCCGCCTCGA CCTGGTGGCC GTCGACGCCA GCGGCAAGGC GCTCTACAGC TTCACCGAGA ACGGCGTCAG CCGCTACGGC GCCGAGTTCG CCAACGGCGA CGGCAAGTCC AACACCCTCG CCCTGTACCT GACCGACGAG TGGAGCATCA CGCCGAAGCT GCGTGTCGAC GCCGGCGTGC GCTGGGAGAA GATCGAGTTC GAAGGCCGCA GCGAGCGCAG CGCGACGAAG AACCTGGGCC AATCCCCGAC CCCGGCCGAC GACACGGTGC TGTCCGGCAC GGGCGTCTAC GACCATCTGG ACCGCAAGTT CCAGCACGCC GGCTGGACCC TCGGCGTCGA CTACAAGATC ACCGGCCAGA TGGGCATGTT CGGCCGCTTC ACCTCGGCCT TCCGCCTGCC GTCGCTGGGC GACTACATCA CCAACGCCAC CAACTCGACG GGCACGGTCC AGACCATGGA TCTGTCGGAG CTGGGCTTCA AATGGGTGAC CCCGAAGATC GAGCTCTACG CCACGGCCTT CCAGACCACC TATGACAACC TCGGCTTCGG CAGCCTGGTG TTCAACCCGA CCACCGGCGC CTACGTCAAC CAGACCAGCG TCACCGACAC CAAGACCCTG GGCCTGGAGC TGGAAGGCAC GGTGCGACCG GTCTCGTGGT TCGACCTGCG TCTCAGCGCC ACGTTCCAGA ACCCGGAATT TGGCGACTAC AAGTTCGAGG AGAACTGCAC GGTCGCGGCG ACCGACCCGA CCTGCCAGGT CAAGCCGCCG GCGGCCACCG CCAGCCGCAC CCGCGACTTC ACCGGCAACC AGCTGATCCG TGTGCCCAAG ACCTCGTTCC GCCTGACGCC CGGCCTCAAC CTGCTGGACA GCAAGCTGCG GCTGGAGGCC AGCGTCGAGC GCTACGATGA TCGCTATTCC GACGCCGCCA ACACCTCCAA GCTGCCGGCC TACACCCTGG TCGGCGCCAC GGTGCGCTAC CAGATCACCG ACGGCCTGAC GGTCTATGCC TATGGCGCCA ACCTGTTCAA CGAGCTGGGC CTGACCGAGG GCAACCCCCG GGCCGGCCAG ATCGTCAGCG GCGAGGCCGG GTCCCTGTAC GGCATCGGCC GGCCGGAGTT CGGCCGCTCG TTCCGCGCGG CGCTGATGTA CCGGTTTTAG
|
Protein sequence | MTATNDKGAL LMIKTLGLRI GACALLATTA LVSIVPATAQ AQAAAQVTFD IPAGDTATAL NAFSRQAGVQ LMFPYDVAAR HRTAGLKGAY GREEALRRLI DGGELEIASA TAQVITLREK SAGPLAAAPA DMVGEIIVTS RAGSDVRTRL ETSYAVTTMS AESLRLRSPM GVADALKAVP GFWVEASGGE ASANIRARGI PQEGYSAIAL QEDGAPIQHD GGLGYLNADQ SFRLDETIDR IEVVRGGPSS IFASYAPGGT VNFITRKATD TPEGLAKVQV SDYGTKRVDL FYGGPVGGGW HASVGGFWRE EDGIRDPGFT ANKGYQWRVE AGRAFERGSI EFNLKHLDDN VILFAGVPVK FNAAGEPSAA PGFDPLTGTL AGPETAHLTL RGPTGPFNWD LTRGTEVELT QATAVFKYEP FDGWHFQDTL RYRTSESKRI GLFPNTPVLG TKRIDQVTTD FLKANNNAAS LGNAASLGQV IPGAVGLQLR YSTTGEVFNT AGAGQNGNGL VLDGSLRYVS VPLDELINDA RVLHKFEMGD QTHDVAFGVY TAHVEEAFNR YSANTLLDVQ SNARRLDLVA VDASGKALYS FTENGVSRYG AEFANGDGKS NTLALYLTDE WSITPKLRVD AGVRWEKIEF EGRSERSATK NLGQSPTPAD DTVLSGTGVY DHLDRKFQHA GWTLGVDYKI TGQMGMFGRF TSAFRLPSLG DYITNATNST GTVQTMDLSE LGFKWVTPKI ELYATAFQTT YDNLGFGSLV FNPTTGAYVN QTSVTDTKTL GLELEGTVRP VSWFDLRLSA TFQNPEFGDY KFEENCTVAA TDPTCQVKPP AATASRTRDF TGNQLIRVPK TSFRLTPGLN LLDSKLRLEA SVERYDDRYS DAANTSKLPA YTLVGATVRY QITDGLTVYA YGANLFNELG LTEGNPRAGQ IVSGEAGSLY GIGRPEFGRS FRAALMYRF
|
| |