Gene Caul_4973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4973 
Symbol 
ID5902435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5377127 
End bp5378287 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content63% 
IMG OID641565494 
Producthypothetical protein 
Protein accessionYP_001686591 
Protein GI167648928 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.351891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGCA GCTTCGACAC CCTGAACGAG ATGGATCCCG CGACGCGGGC CTTTGCTGAG 
GTGCGCGCAC GCGAAGCCGG AATGAGCCTC GCCGAATGGC TGGGGGCTTT AGTCCACGAA
CGCGCCGCGC CGCAGGTGCC GCGTCGCTTT TACGGTGGCG GAGGACCAAT CTGGGCTGAA
GGCCCGGGGG CTGGTCGTTC GCTGGCGATA TCGATGGGCG CTCGACCAGC CTCCGCCTAC
GACATGGACG TCGCGCACTA CATGCCGATC CTGAGATACC CAGAGATCGA GAGGGGCGAT
CTCAATCGTA TTTTTCTCGA CCACGAGGTT CATGAGGTGA CGCGGTCGGC CCTTCACATG
ATCGCATTGT TTCCGCCACG GATCAGCCAT GCGGCAATCC TCCTTAGTGC GGCGTTTGCA
GAAGCAGACG TGCAGATCCG GTTGCCGCCG TACCGCCCGA GCGGCCACCA CAAAGCCTCA
GGCCGTTGGT ATCACACGCT CGTCGATACC TGCCTGCATC GCTCGCGGGA GGTTCGGAAT
GCCTTGATGC ATGAATGGAC GCCCAGTGGG CGAGCTGAGC TCATGTGGGG CGACGCCTTC
GCCACCCTTT GCGACCTCAC GTCGGCGCGC CAAGAACTCC TGGTCATGTT TCACCACGAT
CGGAGCCGGG CGTCTGTCGA ACTCGCCAAA CACCTCGACG TTGCGACCCA GCAGTTCGAG
TTGTCCGCCA ATCTTTTGGC GGAGGTCGAT ACGAGCGCGG CAGAGCCGAC TGAACAGCAG
CGCTTCACTC GGCTGCGTGA GGCTTTGCTA GAGCGGGCTG GCGGCGGCGT GTCCCTGACC
GAAGGCGCAA AGCTTCTCAA TGTCTCACGC CAGGCGCTCC ACAAGCGGAT CAAGGCCGGT
ACGGCGCTAG GCATGATGGA TGGCGACGAG CTCGTCTTGC CGCGTCTGCA GTGGGTCACA
AAGGGGGACG AAGTCCAGTT TCTCCCTGGA CTGACCGATG TGGTGAGGCA GTTCGAGCGC
GCGGGCGGAT GGTCGGCGCT GCAGTTCCTC CTCGATCACG ATCCCAATCT CGCAAAGCCG
CCGATCCAGG CGCTGCGCGA AGGCTCTCCT GAGAAGGTGG TTGCGGCCGC GCGCGCCTAC
CTTGGCCTGG ACGAGGAATG A
 
Protein sequence
MMSSFDTLNE MDPATRAFAE VRAREAGMSL AEWLGALVHE RAAPQVPRRF YGGGGPIWAE 
GPGAGRSLAI SMGARPASAY DMDVAHYMPI LRYPEIERGD LNRIFLDHEV HEVTRSALHM
IALFPPRISH AAILLSAAFA EADVQIRLPP YRPSGHHKAS GRWYHTLVDT CLHRSREVRN
ALMHEWTPSG RAELMWGDAF ATLCDLTSAR QELLVMFHHD RSRASVELAK HLDVATQQFE
LSANLLAEVD TSAAEPTEQQ RFTRLREALL ERAGGGVSLT EGAKLLNVSR QALHKRIKAG
TALGMMDGDE LVLPRLQWVT KGDEVQFLPG LTDVVRQFER AGGWSALQFL LDHDPNLAKP
PIQALREGSP EKVVAAARAY LGLDEE