Gene Spro_4068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_4068 
Symbol 
ID5604776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp4508690 
End bp4509931 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content46% 
IMG OID640939629 
Producthypothetical protein 
Protein accessionYP_001480291 
Protein GI157372302 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTTA TTGCTGTTCT TGAGTCAAAC CTGGCGGGCA ATGGTGCCTT AGTCATTCAG 
GCGGCCAAAC TAAAGGGCTA TAAAACGCTT TTCGTCTGTG GCAATAAAGC GGAATACGCC
AATGCCGTGA TTAATCCGAT TGATTTTGCT GATGAAGTGG CCGTTGTTGA CTCGTATGAC
ATTGCCAAGC TGCTGGCCTT CTTCCAGAGT TACCCGGATA CGGTTGACGC AGTCATGGCA
TTTGATGACT TTAGAATGAT ACAGGCCGCG ATCATTAACC AGTTTCTTAA CCTGCCGTAT
GCGCCAGCTG TTGAGGCATT ATTGACCGTG CGATTTAAAC ATTTGTTAAG AGAAAAACTC
AACAATACTG CGTATGCGAT TGACTTTACA CGTTTGGCGG GTGACAGCAC AGCCCATCAA
GCGTTATTGA AATACCCGTG TGTGATCAAG CCGGTAGATG AAAGTGGCAG CATAGGTGTC
AAGGTCTGTC GTTCAAAATC GGAGTCGGAC GAGGCTATCG ACTATATTCT GTCTCTGCCA
AAATATAATG GTCGCGGGTT TACCGTTTCA AAAGACATCC TGGTTGAAGA GTTTATTACC
GGGGAAGAAT ACAGTGCCGA GTTGGTTTGG GATTGTGAAA ATGAGGATTG GAGACTGCTT
GGGGTTACTC AAAAATTTGT TACCCCTCCC CCGTTCTGCG TAGAAAAAGC CCATATTTTT
CCGTATGCAG ACGGCAGTGA ATTTTCAGAG CAGGTGAGCC TGCATGCCAA CCGTATACTT
GAATATGTTG GTCTGCGAAA TACGTTTGTA CATATGGAAT TTAAATTCTC CGATGGGTGC
TTTAATGTCA TTGAAATCAA TCCGCGCTTG GGGGGAGACA TGATTATCGA ACTCATGAAA
AATGCGAAAG GGTATGATGC TGCTGGTCTG ATGTTGGAAG CGAATATCAA TCAACCTTTG
AGCATAGAAA ACACCGGCGT ACAGGGGGAT GCTTCGGCTA TCGTGTATAT CACAGATAGT
CGCGGTGGGC ATATAACCGA TATTACCGTC GAGAGTGATG AGGAAATTTT CTCCCGGGTC
AATATTTTTT CGCTCCCAAA AACACTGAAA GGGTTGCGCA GCAGCGAAGA CCGGTTGGGG
TATGTCATTT TAAATAAAGA CCGTTATCAG CAGATTGAGC TGCGTGTTAC CGCTCTGCTC
GATGGTAATG GTTTCAATAT TACCCAGTCG GCATTGTCCT GA
 
Protein sequence
MSVIAVLESN LAGNGALVIQ AAKLKGYKTL FVCGNKAEYA NAVINPIDFA DEVAVVDSYD 
IAKLLAFFQS YPDTVDAVMA FDDFRMIQAA IINQFLNLPY APAVEALLTV RFKHLLREKL
NNTAYAIDFT RLAGDSTAHQ ALLKYPCVIK PVDESGSIGV KVCRSKSESD EAIDYILSLP
KYNGRGFTVS KDILVEEFIT GEEYSAELVW DCENEDWRLL GVTQKFVTPP PFCVEKAHIF
PYADGSEFSE QVSLHANRIL EYVGLRNTFV HMEFKFSDGC FNVIEINPRL GGDMIIELMK
NAKGYDAAGL MLEANINQPL SIENTGVQGD ASAIVYITDS RGGHITDITV ESDEEIFSRV
NIFSLPKTLK GLRSSEDRLG YVILNKDRYQ QIELRVTALL DGNGFNITQS ALS