Gene Caul_4454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4454 
Symbol 
ID5901915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4821797 
End bp4822906 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content66% 
IMG OID641564973 
ProductABC transporter periplasmic binding protein, urea carboxylase region 
Protein accessionYP_001686072 
Protein GI167648409 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR03427] ABC transporter periplasmic binding protein, urea carboxylase region 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCT GGTTCAACGG TATGCGTATC GCCGCGGCTG TCGCCGCATG CGGGCTGGCG 
CTCAGCGCCT GCGGTCCCAA GACCGAAACC AAGACGGCCG CCGCGCCGGC GCCCGCCGCC
GTCAAGACCG ACTACAGGAT CGGCTGGACG ATCTATGCCG GCTGGATGCC CTGGGCCTAC
GCCCAGCAGT CGGGCATCGT GAAGAAATGG GCCGACAAGT ACGGCGTGCA GATCGAGCTG
GTGCAGATCA ACGACTATGT CGAGTCGCTG AACCAGTTCT CGGCCGGCAA GCTGGACGGC
GTCACCGCCA CCAACATGGA CGCCCTGACC GTGCCGGCCG CCGCCGGAAA GGACACCACG
GTCCTGATGA TCGGCGACTA TTCCAACGGC AATGACGGCG TGATCCTCAA GAACGGCGAG
ACCCTGGCCG ACATCAAGGG CCGGCCGGTC AACCTGGTCG AGCTGTCGGT CTCGCACTAC
CTGTTGGCCC GCGCGCTGGA AAAGGCCGGG CTGAAGATGG CCGACGTCAA GACGGTCAAC
ACCTCCGACG CCGACATCGT CGCCGCCTAT GGCGCCGCCG ACACCAAGGC CCTGGTCACC
TGGAACCCGC AGCTGTCGGA AGTGAAAAAG ATGCCGGGCG CGAGCCTGGT GTTCGACAGC
TCCAAGATCC CCGGCGAGAT CCTCGACGGC CTGATGGTCA GCACCGACGC GCTGAAGGCC
AATCCCAACC TCGGCAAGGC CCTGACCGGC ATCTGGTACG AAACCATGGC CCTGACCGTC
GCCCAGACCC CGGAAGGCAA GGCCGCGCGC GAGGCGATGG CCAAGCTGTC GGGCGCCGAC
CTGGCCAGCT TCGAGAGCCA GTTGAAGACG ACCTACCTCT ACGCCGACCC CACGGCCGCC
CTGGCCGCGA CGGTCAGCCC CGACCTAGTC ACGGCCAACG ACCGGGTGCG CAAGTTCAGC
TTCAGCATGG GCCTGTTCGG CCAAGGCGCG AAGTCGGTGG ACGACATCGG CATCAGCTTC
CCGGGCGGCA AGACGCTGGG CGACCCGGCC AATGTGAAGC TGCGCTTCGA TCCGACCTAT
GTGCAGCAGG CGGCGGACGG CAAGCTGTAG
 
Protein sequence
MKTWFNGMRI AAAVAACGLA LSACGPKTET KTAAAPAPAA VKTDYRIGWT IYAGWMPWAY 
AQQSGIVKKW ADKYGVQIEL VQINDYVESL NQFSAGKLDG VTATNMDALT VPAAAGKDTT
VLMIGDYSNG NDGVILKNGE TLADIKGRPV NLVELSVSHY LLARALEKAG LKMADVKTVN
TSDADIVAAY GAADTKALVT WNPQLSEVKK MPGASLVFDS SKIPGEILDG LMVSTDALKA
NPNLGKALTG IWYETMALTV AQTPEGKAAR EAMAKLSGAD LASFESQLKT TYLYADPTAA
LAATVSPDLV TANDRVRKFS FSMGLFGQGA KSVDDIGISF PGGKTLGDPA NVKLRFDPTY
VQQAADGKL