Gene Caul_2950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2950 
Symbol 
ID5900405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3200527 
End bp3201867 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content69% 
IMG OID641563447 
Productamino acid permease-associated region 
Protein accessionYP_001684575 
Protein GI167646912 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.534787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCC ACGCCGACAT CCTGTTTTAC GTCAGTGTCG GGGCCGGCAT GGCCCTGGCG 
ACCAGCGTGT TCACGATGAT CGGCGGCCTG TTCGCGGTGG CCAGCCTGCC GTGGATCGTG
ACGGGCGTCG TGCTGGCGGG ACTGTTCTGC GGCGCGATCT CGCTGTCGAT CGGCGAGCTG
GCCAGCCTCT ATCCCTCGGC GCCCGGCATC CGCACCTATT TCAAGGCGGC GTTCGGCGAG
TTCCCGTCGC TGGTGGCCAT CTATCTGTAC CTGGCCTTCG CGATCATCGT CGCGGGCCTC
GAAAGCTTCG TGTTCGCCAG CGTTGTCGGG ATGGTGGCGC CAGACTGGCC CAGGGAGGCC
ACGGTCCTGG TGCTGCTGCT GGTGGTGGTC GGCACCAACC TGGCCGGCTT CCAACTGCCG
CGCGGCCTGC AGATCGGCTC GACCGTCGGG GCGGTGGGCC TGGTGCTGGT CGCGGCGGTC
TGGGCGCTGG CGCGGAACGG ATCGGCGCCG CATCCGCTCT CCCCGCCCCT CGCCGGATCG
CTCGCCCAGC TGCCGGCCCT GGTCGGGATG AGCATTTTCC TGTTCACCGG CTTTGAATGG
GTGACGCCGC TGGGCCTCAA GCCCTCGGCC TACAAGATGC AGATCCCGGT CTCGATGCTG
CTGGGGCTGC TGGTGCTGAC CCTGACCTAT GTGCTGTTCA CCCTGGGCGC GGCGGCCCAG
GTCCCGGCGA CCTCCCTGGC CGGCGCCCTG GCGCCGCAGG TGGCGCTGTT TCGGCAGATC
TATGGCGAAG TCGGGCTCTA TGTCGGCCTG GCACTGTCGG TGCTGGCGAT CTTCTCGACC
TTCAACGCCG GCATTCTCGG TGGCGCGCAG TTGATCTACC TGCTCGGCCG CGAAGGTGCC
CTGCCCCCGT GGCTGGCGGT GATGTCGCCG CGCACCGCGA CGCCGACCGG GGCGATCCTG
CTGCTCGGAT CGCTGGCCAG CGTCTCGGCG ATCATCGTCC TGACCTTCAG GCTGGAGATC
ACCGCCGCCC TGGTCGGCGC CACCATCATG TGCGCGGTCT ATAGCGGCTT CGTCGCCTGC
GGCCTGCGGC TGAAGACCAG GCCCGCCGCG CCGGGTCGCC GGTTCACCAA CCCGCTGCCG
GCCTGGGCGC AGATCCTGCT GGTCCCAGTG TTGCTGATCG TCGGCGTCCA GACCCTGTTC
TCGGAGCCCA AGACCACGGT CTCCGCCCTG GTCGGCCTCG CCGTGGTGCT GGCCATCGCC
TGCCTGCTGG CGACCTATTC GACCTCGCTG CGCGCTGGCG AGCGCCGGGC CGCCACGGCC
ATGCCCCGGA GGGTCGAATG A
 
Protein sequence
MKRHADILFY VSVGAGMALA TSVFTMIGGL FAVASLPWIV TGVVLAGLFC GAISLSIGEL 
ASLYPSAPGI RTYFKAAFGE FPSLVAIYLY LAFAIIVAGL ESFVFASVVG MVAPDWPREA
TVLVLLLVVV GTNLAGFQLP RGLQIGSTVG AVGLVLVAAV WALARNGSAP HPLSPPLAGS
LAQLPALVGM SIFLFTGFEW VTPLGLKPSA YKMQIPVSML LGLLVLTLTY VLFTLGAAAQ
VPATSLAGAL APQVALFRQI YGEVGLYVGL ALSVLAIFST FNAGILGGAQ LIYLLGREGA
LPPWLAVMSP RTATPTGAIL LLGSLASVSA IIVLTFRLEI TAALVGATIM CAVYSGFVAC
GLRLKTRPAA PGRRFTNPLP AWAQILLVPV LLIVGVQTLF SEPKTTVSAL VGLAVVLAIA
CLLATYSTSL RAGERRAATA MPRRVE