Gene Caul_0601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0601 
Symbol 
ID5898056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp663118 
End bp664245 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content49% 
IMG OID641561083 
Producthypothetical protein 
Protein accessionYP_001682232 
Protein GI167644569 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000252439 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACG AACTTGAAGA TCATGCGGGG ACTGCGCCTG CCACCATCGA AGAGCAGGAG 
GAGCAGGAGG CCACGACTGA CGATTCCAAC GAGACGCCAC CGTCCGATAT CGTCGCCTAT
AACGAGCTAC GTTCGTGTGC CGACGTATTC CGCATGCATG CCGAAGGCGT TCTTGACATT
CAGCCGGAGT TTCAAAGGGA GTTTGTCTGG AAGGGTATCG CGCAAACTCG CTTTATCGAT
TCCCTCGTCA AACAACTCCC AATACCTTCG ATGTGCTTTG CTTACGACTA CAAGCAAAAT
AGATGGATAG TCATTGATGG CCTTCAACGC ATATCAACAA TTATAAGGTT TCTTGACGGC
GATAAGCGCT GGAGGCTTTC ATCTCTGCCC GATATTGATC AGCACCTTTC TGGAGCTAGC
GTTGCTGACA TTAAAAGTGG CAAGAAACCC GAACTGAAAA ACTTCTACGC TCGCGTCCAA
AACCAGACTC TGCCTGTGAA TGTTCTTCGG TGTGACTTCA AGAAGAAGCG GCACAACGAG
TATCTATTCA CAATTTTTCA TCGCCTGAAT TCGGGCGGGT CAAAGCTAAA CAATCAAGAA
ATTAGAAATT GTATATACTC CGGCCCCTTC AATGACCTCC TCCGAAGCCT CGATAAGCTT
CCGGAGTGGA GAACCATCAA TCATATGAAA GATGACGGCG ATCAGAGATT CATAAAGCAA
GAATGGATTC TCCGACTGTT TGCATTCTTG GAAGATGGAG CAAAATACAA GGGCTCCGTT
TCAAAGTTTC TGAATGACTT CATGTTTGAG CACAGAGATG ATCCAGCGAA AGCGCTGGGC
GCTCGGCGCG ACCTGTTTGA ACGTGTCGTG AAGGTGATGG GTCACAAGAT ATTCGATGAT
AAGCAACCTG ACCGCATGCC TGGCACGGTG CTTGAAGCCA TTATGGTTGG GATTGCGCGC
AATCTAGCCA AGTGCGAAGC CGCCAACGCC GATGATCTAA AAAAGCAATT TCGGACGATG
CTCGATGACG AGAGTATTTC GGACGTCTCG CTCGCCGAAG GACTTTCCAA ACCGGATAAA
GTAACTGCTC GCTTTCAGGC CGCCACCAAA ATTTTCGCGG GCGGATAA
 
Protein sequence
MADELEDHAG TAPATIEEQE EQEATTDDSN ETPPSDIVAY NELRSCADVF RMHAEGVLDI 
QPEFQREFVW KGIAQTRFID SLVKQLPIPS MCFAYDYKQN RWIVIDGLQR ISTIIRFLDG
DKRWRLSSLP DIDQHLSGAS VADIKSGKKP ELKNFYARVQ NQTLPVNVLR CDFKKKRHNE
YLFTIFHRLN SGGSKLNNQE IRNCIYSGPF NDLLRSLDKL PEWRTINHMK DDGDQRFIKQ
EWILRLFAFL EDGAKYKGSV SKFLNDFMFE HRDDPAKALG ARRDLFERVV KVMGHKIFDD
KQPDRMPGTV LEAIMVGIAR NLAKCEAANA DDLKKQFRTM LDDESISDVS LAEGLSKPDK
VTARFQAATK IFAGG