Gene Caul_4077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4077 
Symbol 
ID5901539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4417899 
End bp4420778 
Gene Length2880 bp 
Protein Length959 aa 
Translation table11 
GC content69% 
IMG OID641564598 
Productcyclic nucleotide-binding protein 
Protein accessionYP_001685700 
Protein GI167648037 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGA CCAACGACAA GGGGGCTCTC CTAATGATCA AGACCTTGGG TCTGCGCATC 
GGCGCATGCG CTTTGCTGGC CACCACGGCC CTGGTTTCCA TCGTCCCGGC GACGGCCCAG
GCTCAGGCCG CCGCGCAGGT GACCTTTGAC ATCCCGGCCG GCGACACCGC CACGGCCCTG
AACGCCTTCT CGCGCCAGGC CGGCGTGCAA CTGATGTTCC CCTATGACGT GGCCGCTCGT
CACCGCACGG CGGGGCTGAA GGGCGCCTAT GGCCGCGAGG AGGCCCTGCG CCGGCTGATC
GACGGCGGCG AGCTGGAGAT CGCCTCGGCC ACCGCCCAGG TCATCACCCT GCGCGAGAAG
AGCGCGGGCC CTTTGGCCGC CGCGCCAGCT GACATGGTCG GCGAGATCAT CGTCACCTCG
CGCGCCGGCT CGGACGTCCG CACCCGGCTG GAGACCAGCT ACGCCGTCAC CACCATGAGC
GCGGAGTCGC TGCGCCTGCG CTCGCCGATG GGCGTGGCCG ACGCGCTGAA GGCCGTGCCC
GGCTTCTGGG TCGAGGCCTC GGGCGGCGAA GCCAGCGCCA ACATCCGGGC GCGCGGCATC
CCGCAGGAAG GCTATTCGGC CATCGCCCTG CAGGAGGACG GCGCGCCGAT CCAGCACGAC
GGCGGCCTGG GCTACCTCAA CGCCGACCAG TCGTTCCGCC TGGACGAGAC CATCGATCGC
ATCGAAGTGG TGCGCGGCGG CCCCTCGTCG ATCTTCGCCT CCTACGCCCC CGGCGGTACG
GTCAACTTCA TCACCCGCAA GGCGACCGAC ACGCCCGAGG GCCTGGCCAA GGTCCAGGTC
AGCGACTACG GGACCAAGCG CGTCGATCTG TTCTACGGCG GCCCGGTCGG CGGCGGCTGG
CACGCCTCGG TCGGCGGCTT CTGGCGTGAA GAGGACGGCA TCCGCGACCC GGGCTTCACG
GCCAACAAGG GCTATCAATG GCGCGTCGAG GCCGGCCGCG CCTTCGAGCG CGGCAGCATC
GAGTTCAACC TCAAGCACCT GGACGACAAC GTCATCCTGT TCGCCGGCGT GCCGGTGAAA
TTCAACGCCG CGGGCGAGCC CAGCGCCGCC CCCGGGTTCG ATCCGCTGAC CGGCACCCTG
GCCGGCCCGG AAACCGCGCA TCTGACCCTG CGCGGCCCCA CCGGCCCGTT CAACTGGGAC
CTGACGCGCG GCACCGAGGT CGAGCTGACC CAGGCCACGG CGGTGTTCAA GTACGAGCCG
TTCGACGGCT GGCATTTCCA GGACACGCTG CGCTACCGCA CCTCGGAGTC CAAGCGCATC
GGCCTGTTCC CCAACACCCC GGTGCTGGGA ACGAAGCGGA TTGACCAGGT GACGACCGAC
TTCCTGAAGG CCAACAACAA CGCCGCCAGC CTGGGTAACG CCGCCAGCCT GGGTCAGGTC
ATTCCGGGCG CGGTCGGGCT GCAACTGCGC TACAGCACCA CGGGCGAGGT CTTCAACACC
GCCGGCGCCG GCCAGAACGG CAACGGCCTC GTGCTCGACG GCTCGCTGCG CTACGTCTCC
GTGCCGCTGG ACGAGCTGAT CAACGACGCC CGGGTGCTGC ACAAGTTCGA GATGGGCGAC
CAGACCCACG ACGTGGCCTT CGGCGTCTAC ACCGCCCACG TCGAGGAAGC GTTCAACCGC
TACTCGGCCA ACACCCTGCT CGACGTGCAG AGCAACGCCC GCCGCCTCGA CCTGGTGGCC
GTCGACGCCA GCGGCAAGGC GCTCTACAGC TTCACCGAGA ACGGCGTCAG CCGCTACGGC
GCCGAGTTCG CCAACGGCGA CGGCAAGTCC AACACCCTCG CCCTGTACCT GACCGACGAG
TGGAGCATCA CGCCGAAGCT GCGTGTCGAC GCCGGCGTGC GCTGGGAGAA GATCGAGTTC
GAAGGCCGCA GCGAGCGCAG CGCGACGAAG AACCTGGGCC AATCCCCGAC CCCGGCCGAC
GACACGGTGC TGTCCGGCAC GGGCGTCTAC GACCATCTGG ACCGCAAGTT CCAGCACGCC
GGCTGGACCC TCGGCGTCGA CTACAAGATC ACCGGCCAGA TGGGCATGTT CGGCCGCTTC
ACCTCGGCCT TCCGCCTGCC GTCGCTGGGC GACTACATCA CCAACGCCAC CAACTCGACG
GGCACGGTCC AGACCATGGA TCTGTCGGAG CTGGGCTTCA AATGGGTGAC CCCGAAGATC
GAGCTCTACG CCACGGCCTT CCAGACCACC TATGACAACC TCGGCTTCGG CAGCCTGGTG
TTCAACCCGA CCACCGGCGC CTACGTCAAC CAGACCAGCG TCACCGACAC CAAGACCCTG
GGCCTGGAGC TGGAAGGCAC GGTGCGACCG GTCTCGTGGT TCGACCTGCG TCTCAGCGCC
ACGTTCCAGA ACCCGGAATT TGGCGACTAC AAGTTCGAGG AGAACTGCAC GGTCGCGGCG
ACCGACCCGA CCTGCCAGGT CAAGCCGCCG GCGGCCACCG CCAGCCGCAC CCGCGACTTC
ACCGGCAACC AGCTGATCCG TGTGCCCAAG ACCTCGTTCC GCCTGACGCC CGGCCTCAAC
CTGCTGGACA GCAAGCTGCG GCTGGAGGCC AGCGTCGAGC GCTACGATGA TCGCTATTCC
GACGCCGCCA ACACCTCCAA GCTGCCGGCC TACACCCTGG TCGGCGCCAC GGTGCGCTAC
CAGATCACCG ACGGCCTGAC GGTCTATGCC TATGGCGCCA ACCTGTTCAA CGAGCTGGGC
CTGACCGAGG GCAACCCCCG GGCCGGCCAG ATCGTCAGCG GCGAGGCCGG GTCCCTGTAC
GGCATCGGCC GGCCGGAGTT CGGCCGCTCG TTCCGCGCGG CGCTGATGTA CCGGTTTTAG
 
Protein sequence
MTATNDKGAL LMIKTLGLRI GACALLATTA LVSIVPATAQ AQAAAQVTFD IPAGDTATAL 
NAFSRQAGVQ LMFPYDVAAR HRTAGLKGAY GREEALRRLI DGGELEIASA TAQVITLREK
SAGPLAAAPA DMVGEIIVTS RAGSDVRTRL ETSYAVTTMS AESLRLRSPM GVADALKAVP
GFWVEASGGE ASANIRARGI PQEGYSAIAL QEDGAPIQHD GGLGYLNADQ SFRLDETIDR
IEVVRGGPSS IFASYAPGGT VNFITRKATD TPEGLAKVQV SDYGTKRVDL FYGGPVGGGW
HASVGGFWRE EDGIRDPGFT ANKGYQWRVE AGRAFERGSI EFNLKHLDDN VILFAGVPVK
FNAAGEPSAA PGFDPLTGTL AGPETAHLTL RGPTGPFNWD LTRGTEVELT QATAVFKYEP
FDGWHFQDTL RYRTSESKRI GLFPNTPVLG TKRIDQVTTD FLKANNNAAS LGNAASLGQV
IPGAVGLQLR YSTTGEVFNT AGAGQNGNGL VLDGSLRYVS VPLDELINDA RVLHKFEMGD
QTHDVAFGVY TAHVEEAFNR YSANTLLDVQ SNARRLDLVA VDASGKALYS FTENGVSRYG
AEFANGDGKS NTLALYLTDE WSITPKLRVD AGVRWEKIEF EGRSERSATK NLGQSPTPAD
DTVLSGTGVY DHLDRKFQHA GWTLGVDYKI TGQMGMFGRF TSAFRLPSLG DYITNATNST
GTVQTMDLSE LGFKWVTPKI ELYATAFQTT YDNLGFGSLV FNPTTGAYVN QTSVTDTKTL
GLELEGTVRP VSWFDLRLSA TFQNPEFGDY KFEENCTVAA TDPTCQVKPP AATASRTRDF
TGNQLIRVPK TSFRLTPGLN LLDSKLRLEA SVERYDDRYS DAANTSKLPA YTLVGATVRY
QITDGLTVYA YGANLFNELG LTEGNPRAGQ IVSGEAGSLY GIGRPEFGRS FRAALMYRF