Gene Caul_0397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0397 
Symbol 
ID5897671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp436316 
End bp437443 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content49% 
IMG OID641560883 
Producthypothetical protein 
Protein accessionYP_001682032 
Protein GI167644369 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000747895 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACG AACTTGAAGA TCATGCGGGG ACTGCGCCTG CCACCATCGA AGAGCAGGAG 
GAGCAGGAGG CCACGACTGA CGATTCCAAC GAGACGCCAC CGTCCGATAT CGTCGCCTAT
AACGAGCTAC GTTCGTGTGC CGACGTATTC CGCATGCATG CCGAAGGCGT TCTTGACATT
CAGCCGGAGT TTCAAAGGGA GTTTGTCTGG AAGGGTATCG CGCAAACTCG CTTTATCGAT
TCCCTCGTCA AACAACTCCC AATACCTTCG ATGTGCTTTG CTTACGACTA CAAGCAAAAT
AGATGGATAG TCATTGATGG CCTTCAACGC ATATCAACAA TTATAAGGTT TCTTGACGGC
GATAAGCGCT GGAGGCTTTC ATCTCTGCCC GATATTGATC AGCACCTTTC TGGAGCTAGC
GTTGCTGACA TTAAAAGTGG CAAGAAACCC GAACTGAAAA ACTTCTACGC TCGCGTCCAA
AACCAGACTC TGCCTGTGAA TGTTCTTCGG TGTGACTTCA AGAAGAAGCG GCACAACGAG
TATCTATTCA CAATTTTTCA TCGCCTGAAT TCGGGCGGGT CAAAGCTAAA CAATCAAGAA
ATTAGAAATT GTATATACTC CGGCCCCTTC AATGACCTCC TCCGAAGCCT CGATAAGCTT
CCGGAGTGGA GAACCATCAA TCATATGAAA GATGACGGCG ATCAGAGATT CATAAAGCAA
GAATGGATTC TCCGACTGTT TGCATTCTTG GAAGATGGAG CAAAATACAA GGGCTCCGTT
TCAAAGTTTC TGAATGACTT CATGTTTGAG CACAGAGATG ATCCAGCGAA AGCGCTGGGC
GCTCGGCGCG ACCTGTTTGA ACGTGTCGTG AAGGTGATGG GTCACAAGAT ATTCGATGAT
AAGCAACCTG ACCGCATGCC TGGCACGGTG CTTGAAGCCA TTATGGTTGG GATTGCGCGC
AATCTAGCCA AGTGCGAAGC CGCCAACGCC GATGATCTAA AAAAGCAATT TCGGACGATG
CTCGATGACG AGAGTATTTC GGACGTCTCG CTCGCCGAAG GACTTTCCAA ACCGGATAAA
GTAACTGCTC GCTTTCAGGC CGCCACCAAA ATTTTCGCGG GCGGATAA
 
Protein sequence
MADELEDHAG TAPATIEEQE EQEATTDDSN ETPPSDIVAY NELRSCADVF RMHAEGVLDI 
QPEFQREFVW KGIAQTRFID SLVKQLPIPS MCFAYDYKQN RWIVIDGLQR ISTIIRFLDG
DKRWRLSSLP DIDQHLSGAS VADIKSGKKP ELKNFYARVQ NQTLPVNVLR CDFKKKRHNE
YLFTIFHRLN SGGSKLNNQE IRNCIYSGPF NDLLRSLDKL PEWRTINHMK DDGDQRFIKQ
EWILRLFAFL EDGAKYKGSV SKFLNDFMFE HRDDPAKALG ARRDLFERVV KVMGHKIFDD
KQPDRMPGTV LEAIMVGIAR NLAKCEAANA DDLKKQFRTM LDDESISDVS LAEGLSKPDK
VTARFQAATK IFAGG