Gene Caul_1589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1589 
Symbol 
ID5899044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1676272 
End bp1677891 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content67% 
IMG OID641562077 
Productamino acid permease-associated region 
Protein accessionYP_001683217 
Protein GI167645554 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.984032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCGCG TTAAGTCACT GGATGCGATT CTAGCCACGG CCAAGAAGAA GTCGTTGCAT 
CGCTCGCTCG GCCCCATTCA GCTCACGCTT CTGGGCGTCG GCGCCATCAT CGGCACCGGC
ATCTTCGTCC TGACCGCCGC CGCGGCCCAG AAGGCGGGCC CGGGCATGAT GTGGAGCTTC
GTGATCGCCG GCGCCGTCTG CGCCATCGCC GCGCTCTGCT ATTCGGAACT GGCCTCGATG
TTGCCCGTGT CGGGCTCGGC CTACACCTAC ACCTACGCGG TGATGGGCGA ATTGCTGGCC
TGGATGGTCG GCTGGGCGCT GATCCTCGAA TACGCCGTGG CCGCCAGCGC CGTGTCGGTG
GGGTGGTCGG GCTATTTCCT CGGACTCATA GAAAACGCCC TGCACTTCCA CTGGCCTGAC
GCCCTGCGCG CCGGTCCAGC CTGGTCGATG AACGGCTTCA TGCCGGTGGC CGACTTCAGC
GCTGGCGTCG TCAACATCCC GGCCATCCTG GTGGCCCTGA CCGTGACCGC CCTGCTGGTT
CGCGGCACGA CCGAAAGCGC CCGCGTCAAC GCCGTGCTCG TCGTGATCAA GGTGACGGCC
CTGACCGCCT TCGTCATCCT GACCATTCCG GTGATCAAGA CCGGCAACTT CTCGCCCTTC
ACCCCCAACG GCTGGTTCGG CCCGCACGGA ACGACGGGCA TGGGCGTGGT CGGCGCCGCC
GCTTCGATCT TCTTCGCCTA TGTCGGCTTC GACGCGGTCT CCACCGCCGC CGAAGAGACC
AAGAACCCCC AACGTAACGT GCCGATCGGC CTGATCGGCA GCCTGGGCAT CTGCACCATC
TTCTACCTGT TGGTCGCGGC CGGCGCGGTT GGCGCCATCG GCGCTCAGCC CGTCATCGGC
GCGGCCGGTG AAGCCGTGCA ACCCGGCTCG GCCGCCTTCC AAGCGGCCTG CGGCCTGGCT
TCGAACGCCG ACCGCCTGGT CTGCTCGAAC GAGGCCCTCG CCCACGTCCT GCGCAAGATC
AACTTCCCGG TGGTCGGCAA CCTGCTGGGC CTGGCCGCCA ACCTGGCCCT GCCCTCGGTC
ATTCTGATGA TGATCTACGG CCAGACCCGC ATCTTCTTCG TCATGGCCCG CGACGGCCTG
CTGCCGGAGA AACTGGCCGC CATCCACCCC AAGTGGAAGA CGCCGCACAT CGTCACCATC
GCCACCGGCA TCTTCGTGGC CATCGCCGCC GCCTTCTTCC CGGTGGGCCA ACTGGCCGAC
ATCTCCAACT CCGGCACGCT GTTCGCCTTC TTCATGGTGG CGATCGCGGT GATGGTGCTG
CGCTACAAGG ATCCGAACCG TCCGCGTCCG TTCAAGACCC CGGCGATCTG GATCGTGGCG
CCGGTGGCGA TGATCGGTTG CGCGGGCCTG TACTTCAACC TGCCGCTGGA ATCGATGCTG
GTGCTGCCGA TCTGGGGCGC TCTGGGCCTG GTGATCTATT TCGCCTACGG GTTCCGCAAG
AGCCACGTCG GCCGCGGCAT CACCGACGAG GTCCACGAAC TGGACCCCGA CGCGCCGGCG
ACGGGCGTGG CCCCGATGCC GGGCGCTCCG GCCCCGGGCT CGCCCGACGA ACGGTCTTGA
 
Protein sequence
MWRVKSLDAI LATAKKKSLH RSLGPIQLTL LGVGAIIGTG IFVLTAAAAQ KAGPGMMWSF 
VIAGAVCAIA ALCYSELASM LPVSGSAYTY TYAVMGELLA WMVGWALILE YAVAASAVSV
GWSGYFLGLI ENALHFHWPD ALRAGPAWSM NGFMPVADFS AGVVNIPAIL VALTVTALLV
RGTTESARVN AVLVVIKVTA LTAFVILTIP VIKTGNFSPF TPNGWFGPHG TTGMGVVGAA
ASIFFAYVGF DAVSTAAEET KNPQRNVPIG LIGSLGICTI FYLLVAAGAV GAIGAQPVIG
AAGEAVQPGS AAFQAACGLA SNADRLVCSN EALAHVLRKI NFPVVGNLLG LAANLALPSV
ILMMIYGQTR IFFVMARDGL LPEKLAAIHP KWKTPHIVTI ATGIFVAIAA AFFPVGQLAD
ISNSGTLFAF FMVAIAVMVL RYKDPNRPRP FKTPAIWIVA PVAMIGCAGL YFNLPLESML
VLPIWGALGL VIYFAYGFRK SHVGRGITDE VHELDPDAPA TGVAPMPGAP APGSPDERS