Gene Caul_5252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5252 
Symbol 
ID5897264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp184789 
End bp186120 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content68% 
IMG OID641555355 
Productamino acid permease-associated region 
Protein accessionYP_001676686 
Protein GI167621901 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.38319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGCCGC CGGCTGGGCT GCGCGAGCGC CACATCCGCT TCATTGCGCT GGGCGGGGCC 
ATAGGCGCGG GACTGTTTCT CGGCTCGGGC GCGGCGCTGC ATAGCGCCGG CCCCACCTTG
CTGGCGGCCT ACGCCGCCAG CGGCCTGGCC GTTTTCATGA TCTGCCGCGC CATGGGAGAG
CTGATCCTGG CCCGTCCGTC GCCGGGCGCC TTCGCAGACT ACGCGACCGA CTTCATCGGC
CCCTGGGCGG GCTATTTCAC CGGCTGGTCC TATTGGTTGA TCTGGATGCT CGCCGGCATC
GCCGAGATCA CCGCCGCCGG CGTGTTCATG CGCTTTTGGT TTCCTGACCT GCCGCAATGG
GTCACGGCCC TGTGCGCGGT CGCTGTGCTC GGAGCGGTGA ACCTGACCTC GACGCGACTG
TTTGGCGAAC TCGAGTTCTG GCTGGTGTTG GTCAAGGTTT TGACGGTCAT CGCCCTAATC
CTTGGCGGAG CCTTCATTCT CCTGACCGGA TTTCACCGCC CGCCGCAGGC CGGGCCGGCG
ACCCTGATCG TCGGCGGATT ATTGCCCCAT GGCTGGGGCG GTCTTCTCCA TGCCCTGCCG
ATCGCGATCT TCGGTTTTGG CGGCGTGGAG ATGATCGGCC TGGCCGTTCA GGACGGCGCC
GACCCCCGCC GCTCCGCCCC GAAGGTCATC AACGGGGTCA TCTGGCGAAT TCTGGTCTTC
TACATCGGCG CCCTGGCGGT CATCATGATG ATCTTTCCCT GGACCCAGCT GGATCCGCGC
CAAAGCCCCT TCGTCGCGGT CTTCGCGAGC CTAGGCCTGC CGGCAGCGGC GGGCGTGATC
AACGCCGTAG TCCTCACCGC GGCGCTGTCC AGTTGCAACA GCGGCCTCTA CTCCGCCAGC
CGCATGCTGG CCGCTCTGGC GCGGCAAGGC CAGGCGCCGT CGTCGCTGGC CGCCCGCGCC
GACCATCGGG TTCCCACGCG CGCCGTCCTG GTTTCGATAG CAGGTCTCGG ACTTGGCGTG
GCCCTCAACT ACGCCCTGCC CGACCGCGCG TTCGGCTATC TCGTCAGCGC CCTGGCCGCG
CTAATCCTGT GGATCTGGGG CGTGATCCTG GTATCGCACC TTCGATATCG CCGCCGCCTT
GCCGCCTTGG GCCAAGCGCC CGGCGCCTTC GCCATGCCGG GCGGCGTCGG GGCGAACGTC
GCCACGCTTG GCTTTCTGGT GCTCGTGGCG GCGATCCTGG CGCTCGATCC GGCCAGCCAG
ATGATCTTCG CCATCGCCGC GGGCTGGTTC GCCCTGCTGG CGATCATCTA TCGGCTGACC
AGGCCGCGCT AG
 
Protein sequence
MRPPAGLRER HIRFIALGGA IGAGLFLGSG AALHSAGPTL LAAYAASGLA VFMICRAMGE 
LILARPSPGA FADYATDFIG PWAGYFTGWS YWLIWMLAGI AEITAAGVFM RFWFPDLPQW
VTALCAVAVL GAVNLTSTRL FGELEFWLVL VKVLTVIALI LGGAFILLTG FHRPPQAGPA
TLIVGGLLPH GWGGLLHALP IAIFGFGGVE MIGLAVQDGA DPRRSAPKVI NGVIWRILVF
YIGALAVIMM IFPWTQLDPR QSPFVAVFAS LGLPAAAGVI NAVVLTAALS SCNSGLYSAS
RMLAALARQG QAPSSLAARA DHRVPTRAVL VSIAGLGLGV ALNYALPDRA FGYLVSALAA
LILWIWGVIL VSHLRYRRRL AALGQAPGAF AMPGGVGANV ATLGFLVLVA AILALDPASQ
MIFAIAAGWF ALLAIIYRLT RPR