Gene Sare_2388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2388 
Symbol 
ID5705824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2744288 
End bp2745919 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content74% 
IMG OID641271866 
Productapolipoprotein N-acyltransferase 
Protein accessionYP_001537237 
Protein GI159037984 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0815] Apolipoprotein N-acyltransferase 
TIGRFAM ID[TIGR00546] apolipoprotein N-acyltransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0733975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000659561 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGGCGA CGGCCGTGGA GACGACGCAG GAACGCCGCC TACCGTTGCC GGCGGCCGTG 
GTGGCGGCGG TCGTCGGGGG CGTGCTGATG CTGCTGGCGT TCCCGCCGTA CGGGGTGTGG
CCGCTGGCGG CGGTCGGGGT GGCCCTGCTG GCCGCCGCGA CGCACCGCCG CCGCCCGCGT
GCCGGCGCCG GTCTCGGGTT CCTGGCTGGG CTGGCACTGT TCGCCCCACT ACTGAGCTGG
ACCAACCTGC ATACCGGTTA CCTGCCGTGG GTGCTGCTGT CGCTGGTGCA GGCCAGCTAC
CTGGCGCTGC TCGGCGCGGC CTCCGCGTGG GTTTCTCCGC TGGTTGACCG GTTCCGGTGG
GGGTGGCCGG TGTTGACCGG GGTGCTGTGG GTGGCGCAGG AGGCGCTGCG GGACCGCACC
CCGTTCGGGG GTTTCCCGTG GGGTCGGCTG GCGTTCAGCC AGGACGCGTC GCCGCTGCTG
CGGCTGGCGG CCCTGGGTGG GGCGCCGCTG GTCACCTTCG CGGTCGCGGT GGTCGGTGGC
CTCCTGGTGG CCGCCGCCTG GGCGGGCTGG GCGGCGTACC ACCGCCCTGG TGGGCGCCGG
TCGCCGGGTG GGTGGGCGCC GGTCGCCGGG TGGCTGGCGG CCGCGCTGGC GGTGCCGGTC
ACCGGGTTGC TGGTGCCGAT GCGCGCCGCC GGTGACGGGG ACACGGTGAC CGTGGCGATC
GTGCAGGGTA ACGTGCCGCG GCTGGGGTTG GACTTCAACG CCCAGCGGCA GGCGGTGCTG
AACAACCACG TCGAGGCCAC CGTCGAGCTG GCCGCGCAGG TGGCCTCCGG CGCTCGGGCC
CGCCCCGACC TGGTGGTGTG GCCGGAGAAC TCCAGCGACA TCGACCCGCT GCGGGACGCC
TCCGCCGGTG GGCGCATCCA GGCGGCAGCG GACGCCGTCG GGGTGCCGAT CCTCGTCGGC
GCGGTGCTGC GCGGCCCCGG TCCGGGTCAG GTACGCAACG CGGGGCTGCT GTGGCAGCCG
GTGACCGGCC CCGACCTTGA CCAGGTGTAC ACGAAACGAC ACCCGGTGCC GTTCGCCGAG
TATGTGCCGT TACGTGACAT AGCCCGGCTG GTCAGCAAGC AGGTGGACCG GGTGCGATCC
GACTTCGTGC CGGGCACCAC GCCGGGTGTG CTGGCCGCCG GTCCCGCGGT GCTCGGTGAC
GTCATCTGCT TCGAGGTCGC CTACGACGAG GTGGTCCGGG ACACCGTCAC CGGCGGGGCG
CAGCTGCTGG TGGTGCAGAC CAACAACGCC ACCTTCGACG TGGCCGAGGC CCGTCAGCAG
CTGGCCATGG TGCGGCTGCG GGCGGTCGAG CACGGCCGGC CGGCGCTGAT GGCCTCGACG
GTGGGCGTTT CCGGGTTCGT CTCCCCGGAC GGGCGGGTAA GCGATGCCAC CGGGTTCAAC
ACCCGCGCGA TCGTGGTGCG ACAGCTGCAC CTCGCCGACG GACGCACCCT CGCAACTCGG
GTCGGGTTGT GGCCGGAGGT GGTGCTGACG GTCCTGGCCG TGGCGGCCCT GGCCGCAGCG
GGGATGCGGC GGCGTGTCCG CGGTGGCCCG ACGGCCGGTA CGCCGGCGAG AGGGGGGCAC
AACGGTGCGT GA
 
Protein sequence
MPATAVETTQ ERRLPLPAAV VAAVVGGVLM LLAFPPYGVW PLAAVGVALL AAATHRRRPR 
AGAGLGFLAG LALFAPLLSW TNLHTGYLPW VLLSLVQASY LALLGAASAW VSPLVDRFRW
GWPVLTGVLW VAQEALRDRT PFGGFPWGRL AFSQDASPLL RLAALGGAPL VTFAVAVVGG
LLVAAAWAGW AAYHRPGGRR SPGGWAPVAG WLAAALAVPV TGLLVPMRAA GDGDTVTVAI
VQGNVPRLGL DFNAQRQAVL NNHVEATVEL AAQVASGARA RPDLVVWPEN SSDIDPLRDA
SAGGRIQAAA DAVGVPILVG AVLRGPGPGQ VRNAGLLWQP VTGPDLDQVY TKRHPVPFAE
YVPLRDIARL VSKQVDRVRS DFVPGTTPGV LAAGPAVLGD VICFEVAYDE VVRDTVTGGA
QLLVVQTNNA TFDVAEARQQ LAMVRLRAVE HGRPALMAST VGVSGFVSPD GRVSDATGFN
TRAIVVRQLH LADGRTLATR VGLWPEVVLT VLAVAALAAA GMRRRVRGGP TAGTPARGGH
NGA