Gene Franean1_6141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6141 
Symbol 
ID5674462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7470434 
End bp7472833 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content75% 
IMG OID641244993 
Productacyltransferase 3 
Protein accessionYP_001510391 
Protein GI158317883 
COG category[I] Lipid transport and metabolism 
COG ID[COG1835] Predicted acyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGCCG GTTCGCTTCC GATGACCGAC GGCCCGCCGG GGGCGGCCAG GCGGGCGAAG 
CCCACCCCGC GCCCGCCGGC CGGGCCAGCC GAGACCCGGC AGCCCACCGA CGCGGAGGCG
ACGGCGGCGG CGAAGTTCGC CTACAACCCG GCGCTCGACG GTCTGCGCGT GATCTGCATC
TACATCATCC TCGCCGGTCA CATGGGCGCC ATCCACGCCA GCAACGTGGC CGTCGACATG
TTCTGCGTGC TCAGCGGGTT CCTGATCACG ACCCTGCTGC TGGCCGAGCA GGCCCGCACG
GGCACCGTCT CGATCGGGCG CTTCCTCGTC CGGCGTGCGT ACAAGCTGAT GCCGGTGATG
TGGGTCTACC TTCTGGTCGG CCTGGCCATC ACGGTCGCCT TCAAGTGGGA TGACATCCCA
TACCGTGACG ACTACATCAA GAGCGCCCTC TCGACGTTCC TCAACGTCAA CAACTGGTAC
AAGGTCGAGA ACCCGCTGGG CGGCGGGCGC TGGCTGGCCC ACGTCTGGTC GTTGTCGATG
GAGGAGCAGT TCTACCTGGT CTGGCCGTGG GTCTTCCTGC TCTTCGTCCG CTCCGCGCGG
CTGCGCCCGT ACCTGTTGAC GTTCCTGATC GCCTCGATCG GCCTGATCAT GGGCTGGACG
TACATGATGG CCGCCAACGG GGCCCCGCGC AGCCGGGTCT ACCTCGCGCC CGACACCCAC
ATCGCCCCGC TGCTCATCGG CTGCCTCGTC GCGGTCTGGC GGGACAACCG GCTGCGCGCG
CTGGCGACCC CGGTGGTCCG CGACAGGAAG GACGGCGACA GGAAGGACGG CGGCGGGAAG
GGCGACGGTA AGAAGGCCGC GACCGGAGCT CACTCCGCGG CGACCGCCGC CGCCGTCGAA
CGCTGGACGA GCGGACGGCG ACTGGCCGCC CTCGGCCTTC CCGCCGGCAT CGTGCTGTTC
CTGCTGGCCT TCCTCGGGCC GAACAAGGAT CTCCCGGAAC CGAACTGGAT CGACTACGGG
GCCTACGTGC CGAGCGCGGC CCTGGGCGCG TTGCTGATCA TCGGCGCCGA CGTCAACCGG
GACGCCCGGT GGGTGCGGCT GCTCGGCTCG CCGAAGATGG CCTGGACCGG AAAGATCACC
TACAGCATCT ACCTGTGGCA CTACCCGTTC ATCTCGGCCG CCGCCGGCCA GCTCGTGCCG
CGGATCGGGC TCTGGCCGTC GGTGGTCGTC GCCGCAGTCT GCACCACGAT CACGGCCTAC
TTCTCGAACC GCTTCATCGA GAAGCCGGTC ATAGCGCGCC GTCCGAAGTG GGCGGACACC
CCGCGTGGCC CGGCCCGCCC CGCCGCCGGC GCGGGCCCGG CGGGGGCCCC CGCCCAGGCG
CCGGCCAAGG CCGGCCCGCG GGAGCCGCGG GAGCGGGACC TGTCCGAGCT GCCCGAGCTG
GAGCCGGTGC TCGCCGGTGT CGGCACGAGC GCCGCCGACG CCGAGCAGGC GGACGGGCGC
GGTGGCCGGG CCGGCGCCCC GCCCCGGCCG GGAGACTGGT TGGACAGCGA CGTCGACTGG
GTGGACGAGG GTTACCCGGG CCGCGGCGGC CCGGCCGGGC ACGGGCCGGC CCGGTCCGGC
TACTCCCACG ATCCGGATTC GCAGCCGATG CCGGCCGTTC CGCGCCCGAG CGGGCCCCGG
GCCGGCGAGG TGGTCTACGA CCGCGCCGAC GGGCCACCCG TGTACGAACA CGGGCCGGCG
GTGGGCGCTA CGCACGCCGG CGGGTTCGAG CCGACCCCCA TCCCGGACTG GGCGGCCTAC
CCGGCCCTCA ACCGCGGCCC CGGATCGGCC GCGGACGCGG GCCCGGCCAT GGGTGCTCCC
CGGCAGGGGC CGGCGGCGTC GATGGGCGGC GACACCATGA ACCTCCACCT TCCCTCGACG
TTCGACCCCG GCGCGCCCCC TGCCCACGGG ACCGGCCCGC GGGCCGGGCA CGACCGCCCC
GGCCGTCCGG CCGATCCCGC CGAGCCCGGC CGCCTGCCCG GGGTCGCCCA TCCGGCCGGC
CACCGGCCCG GCCCGGCACC CGGCCACGCG CCCGCGTACG CCCACGGCCA GGCCCTGGGC
CACGTGCCGG GCCACGGTCA CGTGCCGGGG CCCGGTCATG CGCCGGCGTA TGGCCACGGC
TCGGGGAACG GTCATGGCCC GGGGAACGGC CACGGCCCGG CATACGGGCG GGGGCCCGGG
GAGAGCAGGG GTCCCGGGCG TGACCAGGGC CTCGACCGTC TCGACGGAGC CGGGCACGGC
CCCGCAGACC ACGCCGGGGA CCACGTGCTC GAGCCCGTCC GGGATCCGCG CGCCGGGCGG
GAGCCCGCCG AGCCCGACCC GCTGTTCGGG CCGGTCCCGG GCGCGGGGCG CGACTGGTGA
 
Protein sequence
MRAGSLPMTD GPPGAARRAK PTPRPPAGPA ETRQPTDAEA TAAAKFAYNP ALDGLRVICI 
YIILAGHMGA IHASNVAVDM FCVLSGFLIT TLLLAEQART GTVSIGRFLV RRAYKLMPVM
WVYLLVGLAI TVAFKWDDIP YRDDYIKSAL STFLNVNNWY KVENPLGGGR WLAHVWSLSM
EEQFYLVWPW VFLLFVRSAR LRPYLLTFLI ASIGLIMGWT YMMAANGAPR SRVYLAPDTH
IAPLLIGCLV AVWRDNRLRA LATPVVRDRK DGDRKDGGGK GDGKKAATGA HSAATAAAVE
RWTSGRRLAA LGLPAGIVLF LLAFLGPNKD LPEPNWIDYG AYVPSAALGA LLIIGADVNR
DARWVRLLGS PKMAWTGKIT YSIYLWHYPF ISAAAGQLVP RIGLWPSVVV AAVCTTITAY
FSNRFIEKPV IARRPKWADT PRGPARPAAG AGPAGAPAQA PAKAGPREPR ERDLSELPEL
EPVLAGVGTS AADAEQADGR GGRAGAPPRP GDWLDSDVDW VDEGYPGRGG PAGHGPARSG
YSHDPDSQPM PAVPRPSGPR AGEVVYDRAD GPPVYEHGPA VGATHAGGFE PTPIPDWAAY
PALNRGPGSA ADAGPAMGAP RQGPAASMGG DTMNLHLPST FDPGAPPAHG TGPRAGHDRP
GRPADPAEPG RLPGVAHPAG HRPGPAPGHA PAYAHGQALG HVPGHGHVPG PGHAPAYGHG
SGNGHGPGNG HGPAYGRGPG ESRGPGRDQG LDRLDGAGHG PADHAGDHVL EPVRDPRAGR
EPAEPDPLFG PVPGAGRDW