Gene Caul_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3098 
Symbol 
ID5900553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3361653 
End bp3362771 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content70% 
IMG OID641563601 
Productacyltransferase 3 
Protein accessionYP_001684723 
Protein GI167647060 
COG category[I] Lipid transport and metabolism 
COG ID[COG1835] Predicted acyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAA CCGCCTCGAT GTCTCAGAAC GGCTCGCCGG GCCTGTCCCG TGGCGGCGCC 
CTGGATCTCC TGCGGTTCGT CGCGGCGTTG TTCATCGTGC TCTATCATGT GGCCGAGCGG
GCCCCGGTGT CGCTGTTCGC GATCCACCCG GCCTTCGGGC GCGGCTATCT GGCCACCGAT
TTCTTCCTGA TGCTGTCGGG CTATGTGCTG GCCAGGACCT ACGGGTCCCG CGTCCTTGAC
CAGGGCGTCA GGACCGGCGA TTTCCTCAAG CGTCGCCTGC TGCGCATCTG GCCCGCCCAC
CTGGTGATGC TGGCCCTGTT CGTGGTCTTC GTGCTGGCCA CCGCCGCCAT CGGCCTGGCC
CCGCAGAACC CGCAATGGTT CCAGTGGAGC CAGCTGCTGC CCCAGGTCTT CCTGATGCAG
GCCTGGTTCG TGCCCGGCCC GTCGGGCTGG AACATGCCGA CCTGGACACT CTCGGCCCTG
ATCGTCTGCT ATGGCGGCTT CCCCGCCGCC TGGCGGCTGA CCGCCAAGGT GCGCTCGCCT
TGGACCACCC TGGCGATCGG CGTCGTGATC TTCCTGGTCG TCGACGCCGC CGCCAAGGCC
GTCACCGGCA TACCGGCCCA CCAGCTGCCG CTGCGCTTTG GCCTGGTGCG CGGAATCCCG
CTGTTCATCC TGGGCATGCT GATCGCCCGC CTGCCGACGA CCCTCGCCCC TCGCCTGGCC
GACGGTCTGG CGATCGCGGC GGGCGTCGGC GTGGTGGCCC TACAGGTCGT CGGCCGGTTC
GACCACGCCA GCCTGGCCCT GCTGGGCCTG CTGATCTACG CCGCCGGCGC CTCGGGCGCG
AAGGGCTGGG GCTGGGCCAG CCTGGCCGGC CGGCTGTCGT TCTCGCTGTT CCTCACCAAC
CAACTGGTCG CCGTGGTCTG GTTCGGCCTG CTGCGCGCGG TCGCCGGCAA GCTGGGCTTC
GACGACCCCT TGCTGTGGCT GACCTGGGCC ATGGCCCTCC CCGCCTGCGT GATCGCCGCC
TGGCTGTTCG AGCGCTTCGT CGACGCGCCG CTGCAGGTGT GGATCAAGGG GTGGTCGCGG
CGCGAGCCGG CGACCAAGGC CGAGCCGGCG CTGGCTTAA
 
Protein sequence
MSQTASMSQN GSPGLSRGGA LDLLRFVAAL FIVLYHVAER APVSLFAIHP AFGRGYLATD 
FFLMLSGYVL ARTYGSRVLD QGVRTGDFLK RRLLRIWPAH LVMLALFVVF VLATAAIGLA
PQNPQWFQWS QLLPQVFLMQ AWFVPGPSGW NMPTWTLSAL IVCYGGFPAA WRLTAKVRSP
WTTLAIGVVI FLVVDAAAKA VTGIPAHQLP LRFGLVRGIP LFILGMLIAR LPTTLAPRLA
DGLAIAAGVG VVALQVVGRF DHASLALLGL LIYAAGASGA KGWGWASLAG RLSFSLFLTN
QLVAVVWFGL LRAVAGKLGF DDPLLWLTWA MALPACVIAA WLFERFVDAP LQVWIKGWSR
REPATKAEPA LA