Gene Franean1_1868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1868 
Symbol 
ID5670270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2243955 
End bp2245592 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content74% 
IMG OID641240790 
Productapolipoprotein N-acyltransferase 
Protein accessionYP_001506212 
Protein GI158313704 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0815] Apolipoprotein N-acyltransferase 
TIGRFAM ID[TIGR00546] apolipoprotein N-acyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0610408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCA GCGCCGGCCG CCCCGTGTCC GTCGACGCGG GCGGGCATGT GCGGCTGGTG 
CCGACCCGGT GGTCCCGGTG GCGGTACCGG GCGCTGTCCG CGTTGCTCGG GGCGGTGCCC
GCCGTGGCGT TCCCCGCGCT GTCGGCATGG CCGGTCGGCT TCGTCGGGAT GGTGCCGGCA
ACCCTGGTGA TCGTGGCGGC CACCGTCCCG CGGGAGGCGG CGATCCGTGC CTGGTGCGGC
GGTACCGGGT TCTTCCTGGC GACCTGCTAT TGGCTGGTTC CGAACACCGG TCCGTTCATC
GTCGTCCTCG GCCTGGCGCT CGGCGTCACC TGGATGCTGT GGGGCGTCCT GGTCTGGACG
GCGCTGCGCC CCCGGCTGCC CTCCCAACCG CCCGGGTACC GCCGTCTCGC CTGGGCGCTG
GTCGCCGTCC CGTCCGGCTG GGTGATCGGG GAGTTCGCCC GGTCGTGGGA GGGCTTCGGT
GGGCCGTGGG CGCTGCTCGG TGCCAGCCAG TGGAACGCCC GACCGTTTCT GCCGCTGGCC
GCCGTCGGCG GGGTGTGGCT GCTGAGCTTC CTGCTCGCCG CCGTCAACCT GCTGGTGGCT
GCGGCGGTCA TGCCGGGCCT GCGGCCGGGG CGGCGGCGAC CGTGGCGGGC CGGCGTCGCG
CTAGCAGCCG GCCTGCTCGT TGCCGTGATG GTGGCCGGTG CGGCGGCCGT GCCCACCCCG
GCCAACACCG GCACGCTCAC CGTTGGTGGC GTCCAGCCAG GCGTCGTCCA CGGTGCCGAC
GTGCGGTTCG CCGACGGTGA GGCGGCCACC AGAAGCCTGG TCGGCGCCGG GGTCGACCTG
GTGGTGTGGG GGGAGAGCAG CGTCGGGTTC GACCTCGTCG ACGACCAGGC ACGGCTGCGC
CAGCTCGAGG ATCTGTCCCG CATGCTCGGT GTCCCGGTCT TGGTCAACAC CGATGCGCGG
CGTGCTGATG AGGTCGCCGG CCCCGACGAT GGGGACGGGG GGATCTACAA GTCGGCAGTG
CTGGTCGGAC CGGACGGCCC GCGCGGGCGG TACGACAAGA TGCGGCTGGT GCCGTTCGGG
GAGTACATCC CCCTGCGCCC CGTCTTCGGT TGGCTGACCG CGGTGACCGA GGCAGCCGCC
GAGAACCGCC GCAAGGGTGT CCGGCTTACC GTGCTCGCCG CGGGGAAACT GGACGGCAGG
ACGATCCGGC TCGGTCCGCT GGTCTGCTTC GAGTCGGCCT TCCCGGACAT GACTCTCCGT
CTGGCGAACG ACGGCGCGGA CGTGGTCGTC GTCCAGTCGG CCACCTCGAC CTTCCAGGAC
AGCTGGGCAC CCGATCAGCA CGCCAGCCTC GCCGCGCTGC GGGCGGTCGA GGCGGGACGG
CCTGTCCTTC ACGCCACGCT GACCGGCGTG TCCACCGCCT TCGACGCCTC CGGCCGGCAG
TTGTTCCGCC TGGGCAGGGA CGGGCGCGGC GCCTATGTCG TAGACCTGCC GCTGACCAGC
GCGACCGGCA CCCCGTATGC CCGCCTCGGC GACTGGGTGC CCCTCGGCTC GCTCGCCATC
GTCGCGGCCG TCGCGCTCGA TGCCGCGGTG CTGCGGGCGG TCCGTCTGCA CCGCGCCAGG
CGCTCCACGA CGGGGTGA
 
Protein sequence
MASSAGRPVS VDAGGHVRLV PTRWSRWRYR ALSALLGAVP AVAFPALSAW PVGFVGMVPA 
TLVIVAATVP REAAIRAWCG GTGFFLATCY WLVPNTGPFI VVLGLALGVT WMLWGVLVWT
ALRPRLPSQP PGYRRLAWAL VAVPSGWVIG EFARSWEGFG GPWALLGASQ WNARPFLPLA
AVGGVWLLSF LLAAVNLLVA AAVMPGLRPG RRRPWRAGVA LAAGLLVAVM VAGAAAVPTP
ANTGTLTVGG VQPGVVHGAD VRFADGEAAT RSLVGAGVDL VVWGESSVGF DLVDDQARLR
QLEDLSRMLG VPVLVNTDAR RADEVAGPDD GDGGIYKSAV LVGPDGPRGR YDKMRLVPFG
EYIPLRPVFG WLTAVTEAAA ENRRKGVRLT VLAAGKLDGR TIRLGPLVCF ESAFPDMTLR
LANDGADVVV VQSATSTFQD SWAPDQHASL AALRAVEAGR PVLHATLTGV STAFDASGRQ
LFRLGRDGRG AYVVDLPLTS ATGTPYARLG DWVPLGSLAI VAAVALDAAV LRAVRLHRAR
RSTTG