Gene Caul_5082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5082 
Symbol 
ID5897332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp2722 
End bp3930 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content69% 
IMG OID641555185 
Productaminotransferase class V 
Protein accessionYP_001676516 
Protein GI167621731 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000086974 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0923906 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACAC GCCGCCAATC CCTGACCGCC CTGCTGGCCG CGCCGCTCGC GCCGGGCCTG 
GCGCGCGGCC AGGACCTGGG CGCGGACCTA CTCAGCCGGT CGGCCTTTTC GATGTCGGGT
GTCTATCTCA ACGCCGCCTA CACCCACCCC TTGCCGCGGG CTGGGGCCGC CGCCTTGCGC
GAGGTGGAGG CCGGACGGCT GGATCCAGGG TCCAGGCCAC GCCCGGCCCG GCAGTCCAAG
GCCTTGTTCG CCCAGCTCAT CAACGCCGAC CCCGACGAGA TCGCCTGGAT CCCATCGACC
TCCTACGGCG AGAGCGCGAT CATCTCGGCG ATGGGGCTGA CGGACGCGCC GTCCGGCCGG
GTGGTGACCG ACATCTTGCA CTTCGATGGC GGGCTCTACG CTTATGGCGA ACTGGCCAAG
CGGGGCTTGG ATCTGGTGGT TCTGCCGATG ACGGCTGAAG GCCGGATCGA CATGAACCGC
CTGGAGGCGG CGGTGGCCAA CGGCGCCAAG CTGGTGGCGG TGTCGCTGGT TTCGATGCTC
AACGGTTTCG AGCATGACCT GAAGACCGTG TGCGAGATCG CCCACCGCAA GGGCGCGCTG
GTCTATGCCG ACCTGGTGCA GGCGGCGGGC GCGGTGCCCA TCGACGTCAA GGCCAGCGGC
GTCGACTTCG CCGCCTGTTC CAGCTTCAAG TGGCTGATGG GCGATTTTGG CTGCGGCTTT
CTTTATGTGC GTCGCGACCG CCTGGAGGGC CTGCGCCATA CCCAGTTCGG CTACCACCAA
ATGGCCAACG CCGCCTATCA CGCGTTTCCG CTGGATGCGC CCGGCCCGGC TATGTTCGAG
TACGCGCCCG AGACCGGAAC CGCCAAGGGC CGGTTCGAGG TCGGCTCGAT CAGCTGGGCG
GCCGAGGCGG CCGTCAGCGT CTCCCTCAAG GCCATGATCG CGGTGGGCGT CGATCAGATC
CAGGCGCGCC GCCAGCCGAT GATCGACCGA TTGGCCGCGG CGCTGGGGGC GCGCTATCGG
CCCCTGACGC CGCCGGGATC GCGGTCGGGG ATCCAGGCCT TCAGCCTGGC CAACGCCAGG
AGTCTGCAGC CGCGCTTGCA GGCGGCCAGG ATCAACATCC AGCTCTACGC CAATCGGTTT
CGGGTCTCCC CCTCCGTCTA CAACACGCCC GAGGAGATCG AGGCTCTGAT CGCGGCCCTG
GCCGCCTAG
 
Protein sequence
MPTRRQSLTA LLAAPLAPGL ARGQDLGADL LSRSAFSMSG VYLNAAYTHP LPRAGAAALR 
EVEAGRLDPG SRPRPARQSK ALFAQLINAD PDEIAWIPST SYGESAIISA MGLTDAPSGR
VVTDILHFDG GLYAYGELAK RGLDLVVLPM TAEGRIDMNR LEAAVANGAK LVAVSLVSML
NGFEHDLKTV CEIAHRKGAL VYADLVQAAG AVPIDVKASG VDFAACSSFK WLMGDFGCGF
LYVRRDRLEG LRHTQFGYHQ MANAAYHAFP LDAPGPAMFE YAPETGTAKG RFEVGSISWA
AEAAVSVSLK AMIAVGVDQI QARRQPMIDR LAAALGARYR PLTPPGSRSG IQAFSLANAR
SLQPRLQAAR INIQLYANRF RVSPSVYNTP EEIEALIAAL AA