Gene Francci3_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2049 
Symbol 
ID3904622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2410435 
End bp2412078 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content68% 
IMG OID637879386 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_481152 
Protein GI86740752 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.998714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.345074 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCCC AGAACAGCGT CGTGCACGGC TTGGTCCAGG CGGCCGGGCG GTGGGACCCA 
ACCATCCACG ACCATGCGGC GGACACCAGG GTCGGACTGG ACGCTCTGCT GGACACGGCG
CTCAGCAATG CCTCCACGCT CGCCGGTCGG CAGGGCGAGA CGGGCCAGCT CCGGATCGGC
ATCCTCATGC CGAACAGCCT CGCCTGGTTG GAGGCGCTCA TCACCACGCT GGCCGCCGGG
TCGGCGGGCG TGCCGCTGCC ACTGCCGAGC GGCTTCGGCG GGCCGCAGGC GTATATGGAC
CACATCTCCC TCTTGGCCGA CACCGCCAGA CTCGACGCGA TCATATACAA TGCCGCAGAC
CTCGCACCGA CGGTACGCGC CCTCAGGAGC CGCCTGAACG GGGTCGAGTT CCTCGACATC
TCAGGCTGGC CCACAGCACG GCCGGCCAGC GTCACCGAGG CGGCCGATGA TCCACGGATC
ATTCAGTTCA CCTCGGGCAG CACATCACGG CCCAAAGGCG TCATCCTGAC CGCCGCCAAC
ATCTCGGCCG CGGTCGCGAT CCTGGCCGAG CACTTCTTTC TCACTCCTAC CGACGCCCTG
GGAAACTGGC TGCCGTTCTT TCATGACATG GGGCTCTTCA TGACCTTGGC GGCGCTCACC
CACGGGTCCA GCCTGCATCT GTGGACGCCA AGCCAGGCCG CGCGCCGCCC GCTGGCCTGG
CTCCGCCAGT TCGCCGAGAA CCGGTGCACC GTGGCGGCGG CTCCCAATTT CTTCTACAGC
CAGCTGGCCG ATGCGGCGGC CAAGGAAGGC ACGCCGGCTG ACCTCGACCT GTCCACCTGG
CGCGTCGCGA TCAATGGTTC CGAGACAGTG CGGGCCGACA CCATAGAACG CTTCACCAGG
GCGTTCCGGC CGGCGGGCTT CCACGAAGCG GCGATGTGGC CGTCCTACGG GCTGGCGGAG
GCGACGCTGC CGGCCGCGAT CCATAGGCCG GGCCTGGGCT TCACCACCCG CGCCGTCGCA
CGCGGGGACC TCGCACCGGG GGAACCTGTG CGTTTCACGG CGGTGGGCGC CCCCGGATCG
CGAACGGTGG TCGGCTGCGG ACGGCAGCTA CGCGGGACCG GTCTGCGGGT AACGGACCCA
CATGGGAACC CGCTGCCCGA GGCCCATCTT GGCGAGATCC AGCTGCGCAG CCCAACCGTG
ATGGCCGGCT ATCTCGACCG GCCGGCGGCC GAGGCACCCG TGACATCCGA AGGCTGGCTG
ATAACCGGGG ACCTCGGCTT CCTCAGCGAC GGCGAGCTCT TCATCACAGG AAGGACCAAG
AACGTAGCAA TCATCAATGG CCAGAACGTC TATGCCGAGG ACCTCGAACA CCTGGTAAGG
GACGCGCTCG GCGACCAGGT TCGCTGCGGG GTCACAGCCG GCATGGATGA AGAGGACCGC
GAGTTCATTC TGATCTGCTT CGAGCACTCG GGCACTTATG AGGAGCAGAG CGAGGCGGTC
ACCTTGGTGC GCAACCAGGT CTCCGCGGCC CTCGGCGGAT TCCGCGCGAC CGTTGTCGCA
CTACCTGACC GCCAGCTTCC ACACACGACC TCCGGGAAGA TTCGCCGAGC TGCCCTGGCG
GACGTGGCGG GACGATACCT CTGA
 
Protein sequence
MGAQNSVVHG LVQAAGRWDP TIHDHAADTR VGLDALLDTA LSNASTLAGR QGETGQLRIG 
ILMPNSLAWL EALITTLAAG SAGVPLPLPS GFGGPQAYMD HISLLADTAR LDAIIYNAAD
LAPTVRALRS RLNGVEFLDI SGWPTARPAS VTEAADDPRI IQFTSGSTSR PKGVILTAAN
ISAAVAILAE HFFLTPTDAL GNWLPFFHDM GLFMTLAALT HGSSLHLWTP SQAARRPLAW
LRQFAENRCT VAAAPNFFYS QLADAAAKEG TPADLDLSTW RVAINGSETV RADTIERFTR
AFRPAGFHEA AMWPSYGLAE ATLPAAIHRP GLGFTTRAVA RGDLAPGEPV RFTAVGAPGS
RTVVGCGRQL RGTGLRVTDP HGNPLPEAHL GEIQLRSPTV MAGYLDRPAA EAPVTSEGWL
ITGDLGFLSD GELFITGRTK NVAIINGQNV YAEDLEHLVR DALGDQVRCG VTAGMDEEDR
EFILICFEHS GTYEEQSEAV TLVRNQVSAA LGGFRATVVA LPDRQLPHTT SGKIRRAALA
DVAGRYL