Gene Franean1_3399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3399 
Symbol 
ID5671770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4029790 
End bp4031358 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content71% 
IMG OID641242287 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001507707 
Protein GI158315199 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCCAC CAGACTTCGC GACGCTGACC CCTGACAAGC CCGCAGTGAT CCTCGCCGGT 
GGCCCCCGCG GGCAGGAACG GATCCTGACC TACCGTGAGC TCGTCGAGGG CTCCAACCGG
CTGGCCCGGT TACTCGTCGA CTCGGGCCTG CGCCCGGGTG ACCGCCTCGC GATTCTCGCC
GAGAACCACC TGCGCTATTT CGAGCTGGTC TGGGCCGGCC TCAACTGCGG CCTCTACATC
ACCCCGGTGA ACTCGCATCT CACCCCGCCC GAGGTCGCGT ACCTGATCAA CGACAGCGGG
GCCAGGGCGT TGATCAGCAG CCGGGCGCTC GCGGCGGTCG CCGAGGCCGT CGTCCCTGAG
ACCCCCGGGG TCGTCCGGCG CCTCATGCTC GACGGGGGCT CCGAGCACTA CGAGGATCTC
GACGCCGCCA CCGCCGGCTT CTCCGCCGAG CCCCGGGACG ACGAGATCCG CGGCACGTTC
ATGCTCTACA GCTCCGGGAC GACCGGCCGG CCGAAGGGAA TCCAGTTCCC GCTGCCCGAC
TGGCCCGCCA GCGAGGGCGA CGCCCCGCTG CTTCCCGGCG CCCGCGGGGC CTTCGGCTTC
AACGCCGAAG CGGTCTACAT CTCCCCCGCG CCGCTCTACC ACGCAGCCCC GCTACGGGTG
TCCGCGCTGA TGCACAGCGT CGGCGGCACG GTCGTCGTCC TCCCGAAGTT CGACGCCGAG
GGGGCCCTGC ACGCGATCGA ACGGTACCGG GTGACCACCT CCCAGTGGGT CCCGACGATG
TTCGTCCGGA TGCTCAAGCT GCCACCCGAG GTCCGCGCCC GCTACGACCT GTCGAGCCTG
CGGATCGCCG TCCACGCGGC CGCGCCGTGC CCGGTCGAGG TCAAGCGCCA GATGATCGAG
TGGTGGGGCC CGATCATCTT CGAGTACTAC TCCGGCTCGG AGAACGTCGG CAGCACCGGG
CTGACCAGCG AGGAGTGGCT GGCGCACCCG GGTTCGGTCG GCCGTGCGCA GGGCGGCGTG
CTGCACATCT GCGGCGAGGA CGGCGCGGAG CTCCCGGCCG GCCAGGACGG AGCCGTCTAC
TTCGAGGCCA AGGGCGCGGG CTTCAATTAC CACAACGACC CGGACCGCAC CAGAGCCGTC
AGCCATCCCG CCCACCCCGG CTGGCGCACC CTGGGCGACA TCGGCCACGT CGACGAGGAC
GGCTACCTCT ACCTCAGTGA CCGGAAGGAC TTCACGATCA TCGCGGGCGG GGTGAACATC
TACCCGCGCG AGATCGAGGA CGTCCTGGTG CTGCACGACG AGGTCGTGGA CGTCGCCGTG
TTCGGCGTGC CGCACCCCGA GCTCGGCGAG CAGGTCAAGG CCGTCGTCCA GCCGGTGCGG
ATGGCCGACG CCGGCGACGG GCTGGCCGCG CGGCTGCTGG AGCACTGCCG CACCAGGCTC
GCGCCGTTCA AGTGGCCGCG GTCGATCGAC TTCGTGCCCG AACTCCCCCG CCTGGACAAC
GGCAAGCTCT ACAAGAAGCC GCTGCGCGAC GCCTACTGGG CCACCGAGGC CTCCACCGTC
GCCCGTTAG
 
Protein sequence
MYPPDFATLT PDKPAVILAG GPRGQERILT YRELVEGSNR LARLLVDSGL RPGDRLAILA 
ENHLRYFELV WAGLNCGLYI TPVNSHLTPP EVAYLINDSG ARALISSRAL AAVAEAVVPE
TPGVVRRLML DGGSEHYEDL DAATAGFSAE PRDDEIRGTF MLYSSGTTGR PKGIQFPLPD
WPASEGDAPL LPGARGAFGF NAEAVYISPA PLYHAAPLRV SALMHSVGGT VVVLPKFDAE
GALHAIERYR VTTSQWVPTM FVRMLKLPPE VRARYDLSSL RIAVHAAAPC PVEVKRQMIE
WWGPIIFEYY SGSENVGSTG LTSEEWLAHP GSVGRAQGGV LHICGEDGAE LPAGQDGAVY
FEAKGAGFNY HNDPDRTRAV SHPAHPGWRT LGDIGHVDED GYLYLSDRKD FTIIAGGVNI
YPREIEDVLV LHDEVVDVAV FGVPHPELGE QVKAVVQPVR MADAGDGLAA RLLEHCRTRL
APFKWPRSID FVPELPRLDN GKLYKKPLRD AYWATEASTV AR