Gene Caul_2996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2996 
Symbol 
ID5900451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3257580 
End bp3259328 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content71% 
IMG OID641563493 
Productacid phosphatase 
Protein accessionYP_001684621 
Protein GI167646958 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGCC TGCTGCTGGC CCTCGTGGCG GCCGATCCGA TCGCCGCGAC GGTGCGGATG 
AATGACCTGC TCGCGGTCGG CACGCACAAT TCCTACAAGC AGGCCGTTCC ACCCGAGGAG
ATGGCGGCCA TGGTCGCCGC CACGGGGCAG GGGGCGCTGG CCCTCGACTA TGGCCATCGC
CCGCTGGCCG AGGAACTGGA CGCCGGCGCT CGCCAACTGG AGCTGGACGT GGTCCGCGAT
CCCGAGGGCG GTCGCTTCGC CAAGCCCGCC ACCGCGTTCG GCAAGGGCGT CGTCCCGACC
CCGGCCTGGG CGGCGGCCAT GGCCAAGCCG GGCTACAAGG TCCTGCACAT GCAGGACGTC
GACTTCCGCT CGACCTGCTG GACCTTCGTC GCCTGCCTGA CGCAGATCGG GACTTGGTCC
AAGGCCCATC CCGACCACGC CCCGATTCTG ATCCTGCTGA ACGCCAAGGA CGGTCCGTCC
AGCTTTCCAA ACGGCGTCGA GGCCCTGCCC TATGACGAGA AGGCCTTCGA CGCCCTGGAC
GGCGAGATCC GTTCGGTGTT CGCCGAGAGC CAGTTGATCA CGCCGGACCA GGTTCGGGGC
GGGCGCGCGA CCCTGCGCGA GGCGGTGCTG GCTGGCGGCT GGCCGACGCT GAACGCGGCG
CGCGGCAAGG TGTTCTTCGC CCTTGACGAG AGCCCCGAGA AAGTCGCCGC CTATCGGGGC
GCGCGCAAGT CGCTGGAAGG TCGGGCGATG TTCGTCAACA CCGACGAGGC CTCGCCGGCC
GCCGCCTATC TTACCCTCAA CGATCCGATC GGCCAGCGCG AGCGCATCGC CGTCGCCGTG
AAGGCCGGCT TCATTGTTCG CACCCGCGCC GACGCCGACA CCTGGCAGGC GCGCAAAAAC
GACGTGGCGG TGCGCACCGC CGCGCTCTCC AGCGGCGCGC AGTACGTCTC GACCGACTAT
CTGTGGGCCG ATCCGCGCTT TCCTGGCGGC TACGTCGTGC GCCTGACCGG CGGCGAGGTC
GCGACCTGTA ACCCGGTCCG CCTCGCCAAC GGCTGCGCGG GGCCGTTCGA GAGCCTGCCG
GGCGCGCCGG TCCAAGGCTA TCTGGCCCCG GCTCAGCGTC CGGACCTGAC CAAGACCCTG
GCCGCCCCGC CGGCCGCCGG CTCGCCGCGC GCCTTGGCCG ACGCGGCCAT CTTCGATCAA
AGCCGAGCCT TGAAGGGCTC CGCGCGCTGG CGGCGGGCGA CCGACGACGT CGATGGCTCG
ACCTACGAGC ACTTCGCCGA GGCCCTGGGC GCGCGCCTGA CCGAGGCCGA TACGCCGATC
CTGACGGCGC TGCTGGAGCG GGCTGGCGAG GATCGCTCGG TGGTCAGCGT CGCCAAGACG
CATTGGGGGA CGAGGCGGCC CTATCTCGAC AAGCCAAACG CGCCGATCTG CGAGGCCAAG
AGCGCGCATC TGGCCGGCAA TCCCGACTAT CCATCGGGAC ACTCGGCCTT TGGCATGCAC
GTGGCCATGA TCCTGGCCGA ACTGGCTCCC AGCCGCGCCG ACGCGCTCTA TTCCCGTGGC
CGCGACTACG CCGAGAGCCG CTGGGTCTGC GGCTCGCACA GCCTCAGCGC CGCCGAGGCG
GGCATGCAGT CGGGCGCGAC GATCTACGCG GCCGAGCATG TCTCGCCGTA CTTCCGTCGC
GATATGGAAG CCGCCCGCGC CGAACTCGAC GCGGCTCTGG CGAAGGCGGT CCCCGCGCCC
CGGCCGTGA
 
Protein sequence
MLSLLLALVA ADPIAATVRM NDLLAVGTHN SYKQAVPPEE MAAMVAATGQ GALALDYGHR 
PLAEELDAGA RQLELDVVRD PEGGRFAKPA TAFGKGVVPT PAWAAAMAKP GYKVLHMQDV
DFRSTCWTFV ACLTQIGTWS KAHPDHAPIL ILLNAKDGPS SFPNGVEALP YDEKAFDALD
GEIRSVFAES QLITPDQVRG GRATLREAVL AGGWPTLNAA RGKVFFALDE SPEKVAAYRG
ARKSLEGRAM FVNTDEASPA AAYLTLNDPI GQRERIAVAV KAGFIVRTRA DADTWQARKN
DVAVRTAALS SGAQYVSTDY LWADPRFPGG YVVRLTGGEV ATCNPVRLAN GCAGPFESLP
GAPVQGYLAP AQRPDLTKTL AAPPAAGSPR ALADAAIFDQ SRALKGSARW RRATDDVDGS
TYEHFAEALG ARLTEADTPI LTALLERAGE DRSVVSVAKT HWGTRRPYLD KPNAPICEAK
SAHLAGNPDY PSGHSAFGMH VAMILAELAP SRADALYSRG RDYAESRWVC GSHSLSAAEA
GMQSGATIYA AEHVSPYFRR DMEAARAELD AALAKAVPAP RP