Gene Sare_2144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2144 
Symbol 
ID5707270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2465958 
End bp2467208 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content71% 
IMG OID641271629 
Productphosphoribosylglycinamide synthetase 
Protein accessionYP_001537000 
Protein GI159037747 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0250211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACCG CACCGACCGA TCACCGTCCC CGGCTGTTGC TGGTGGCCAG CGGCATGCGC 
CCGTACCGGG AGTACCTGCT CGCGTCGATC GCCACCGCCT ACCGCGTCCA CCTGTTCCAC
AGCGCCGACC CCGACTGGGA GCGGCCCTAC CTCGACGGCT GGACCGTCCT CGGTTCCACC
ATCGACGGCC CGGCGATGGC GGCAGCCGCT CGACCGCTCC ATCGGGAGGA ACCGTTCGCC
GGGGTGCTGT GCTGGGACGA GGGTCGCATT CACGCCACCT CCTCCGTCGC CGCGGAACTC
GGCTGTCGCA ACGGCGACCC GACGGTGGTC TGGCGGCTGC GGGACAAGGC CCAGACCCGG
CAGGCGCTCG CCGCCGCCGG CGTACCCCAG CCGCGGTCGG TGCCGGTGAA CTCCGTGCAG
GAGGCCCTGC TGGCCGCCGC GGCGATCGGC TACCCGGTGA TCCTCAAGCC CCGCGGGCTC
GGTGCCAGCC TGGGAGTGGT CCGGGTCGAC GATTCCGACG GCCTACGCCG GTTGTTCCCG
TTCACCCACG GCACGACAGC CCCTGACCCG GTCCAGTACA GCACGGACGC TCCGGTCCTG
GTCGAGCAGT GCGTCTGCGG GACGGAGGTC AGCGTCGACG CGGTGGTGAC CGATGGGCAG
GTGGTGCCCC TGTTCGTGGC CCGTAAGAAG GTCGGCTACC CGCCGTACGC CGAGGAGGTC
GGGCACCTGG TGGATGCGGC GGACCCGTTG CTGCACGACG GCGCCTTCGT CGACCTGCTC
CAACGGACCC ACGGCGCCCT GGGCTTCCGG GACGGCTGCA CCCACACCGA GTACATGCTC
ACGGAGAACG GCCCGCAGTT GATCGAGGTC AACGGTCGAC TGGGTGGGGA CATGATCCCG
TACCTCGGGA AGCTGGCCAC CGGCGTGGAC CCCGGTCTGG TCGCGGCCGC GGCGGCGTGC
GGCCTGCCGC CGCGGACCGT ACCCACGCGT CGACGGGTGG CGGGGGTGAC CTTTTCCTAC
GTGGATTCGG ACGACACCAC CGTCACGTCG GTGACCATAG ACCACGCACG GCTTCCGCCG
ACCGTGGACC GCGTGGTGAC GCTGGTGTCC GCCGGTCAGG TGGTCTCACC GCCGCCGAAG
GGCACCGTGT GGGGGCGGAT CGCCTACGTC ACGGCAGTGG CCGACACCTG GCAGGAGTGT
CAGGCCGCGC TCGATGTGGC GACCAGGGCC GTGGGTTTCA CTCACCCATG A
 
Protein sequence
MHTAPTDHRP RLLLVASGMR PYREYLLASI ATAYRVHLFH SADPDWERPY LDGWTVLGST 
IDGPAMAAAA RPLHREEPFA GVLCWDEGRI HATSSVAAEL GCRNGDPTVV WRLRDKAQTR
QALAAAGVPQ PRSVPVNSVQ EALLAAAAIG YPVILKPRGL GASLGVVRVD DSDGLRRLFP
FTHGTTAPDP VQYSTDAPVL VEQCVCGTEV SVDAVVTDGQ VVPLFVARKK VGYPPYAEEV
GHLVDAADPL LHDGAFVDLL QRTHGALGFR DGCTHTEYML TENGPQLIEV NGRLGGDMIP
YLGKLATGVD PGLVAAAAAC GLPPRTVPTR RRVAGVTFSY VDSDDTTVTS VTIDHARLPP
TVDRVVTLVS AGQVVSPPPK GTVWGRIAYV TAVADTWQEC QAALDVATRA VGFTHP