Gene Caul_0478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0478 
Symbol 
ID5897933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp518674 
End bp519879 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content67% 
IMG OID641560961 
Producthypothetical protein 
Protein accessionYP_001682110 
Protein GI167644447 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGGT TTCAATCCGG ACACGCCCGC GCCGCCGATC CGTCTGTCGC CGGGGGCTAC 
AAGGTCGACA TCTCGCGCGG GGAGCGCATC GGCCGCGTCT CGTCGGAATG GTTCTCTCGA
CCCGACGACG AGCGCTACCT TTCGCTCGGC GCGCTCTACG CCGCTGTTCA CGCCCGCGCC
GAGCACGCCA CCTCCCGCAC GGTCGAGACC CGCCGTCTCC GTGTCGAGGC CGATCGCGAC
GGCGCCGCGC GCCTGGCCCT GATCATGCCG GGTCGTGACG AGCCCGTCGC CCCGACTCAC
TGGTCCTTCG GCCAGTTGTG CGGTCTGGTT GGCGCGCCGG CTGGCTACCT TCGCGATCTG
CCCGCCCCCT TGGCCGGCAT CAACCTGCAG CACGGCTTGC TCTCGCATCG CGCTGAACTG
ATCAAGACCC TTGAGACCGA CGACGGCCGC GTCGAACTGC GCGCCGTCAC CGGTCCCGAC
TATGGGCGGA TCTGGGACCA TGAACTGGTC GCGGCGGTGA TGAAGATCGC CGGCGACGGC
ACCGGCGACA CGCGCTGGAA GGTGCCCGGC CTGCTGGACT GGTCGATGAT GACCCACAAT
CCGTTCGTCG AGGTCACCAA GGACACCACC ACCCTCTATG CCAGCGATCG CGACGTCTTC
CTGTTCCTGG TCGATGACGC CCACCCGATC GAGGCGGGCC GCCTGCCGAA CGGCGAGCCA
GATCTTTATT TCCGCGGCTT TTATTGCTGG AATAGCGAGG TCGGCTCCAA GACCCTGGGC
ATGGCCTCCT TCTATCTCCG CGCTGTCTGC ATGAACCGCA ACATCTGGGG CGCCGAGGGC
TTCCAGGAGA TCAGCATCCG CCACAGCAAG TTCGCCGCCC GGCGCTTCGT TCACGAGGCC
GCGCCGGCGC TGGAGCGCTT CGCCAACGCC TCGACCACAC CCTTCATCAA CGGAATACGC
GCCGCGCGCG AGACCATCGT CGCCCGCAAG GACGACGATC GCGAGACTTT CCTGCGCAAG
CGCGGCTTCT CAAAGACCGA GACCGGCAGG ATCATCGCCA CGGTTCTGAA CGAGGAGGGT
CGGCCCCCGG AATCGATCTT CGATTTTGTG CAAGGCATCA CGGCGGTCGC CCGGGACAAG
CCCCAGCAGG ATGCCCGTCT GGAGCTGGAG GCAAAGGCCG GCCGATTGCT GGCCAGCGTC
CGCTAG
 
Protein sequence
MMRFQSGHAR AADPSVAGGY KVDISRGERI GRVSSEWFSR PDDERYLSLG ALYAAVHARA 
EHATSRTVET RRLRVEADRD GAARLALIMP GRDEPVAPTH WSFGQLCGLV GAPAGYLRDL
PAPLAGINLQ HGLLSHRAEL IKTLETDDGR VELRAVTGPD YGRIWDHELV AAVMKIAGDG
TGDTRWKVPG LLDWSMMTHN PFVEVTKDTT TLYASDRDVF LFLVDDAHPI EAGRLPNGEP
DLYFRGFYCW NSEVGSKTLG MASFYLRAVC MNRNIWGAEG FQEISIRHSK FAARRFVHEA
APALERFANA STTPFINGIR AARETIVARK DDDRETFLRK RGFSKTETGR IIATVLNEEG
RPPESIFDFV QGITAVARDK PQQDARLELE AKAGRLLASV R