Gene Francci3_2982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2982 
Symbol 
ID3905478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3531549 
End bp3533045 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content75% 
IMG OID637880302 
Productacyl-CoA synthetase 
Protein accessionYP_482068 
Protein GI86741668 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.186331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.190971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCTGC TCCCACTGCC TGCCGGCGAG CACGACGGCC CCGCCGTCCG GGTCGGGGAG 
GTCGAGTTCA CCCGCGCAGA GCTCTTCGCG GCCGCGTCCG TCGTCGCCGG TCGGGTGGCC
GGCGCGCCGG CGGTGGCCGT GCACGCCGAG GCGACGATGG CCACCGTCGT CTCGGTCGTC
GGCTGCCTGC TCGCCGGGGT GCCGGCGGTA CCCGTGCCGC CTGACTCCGG GCCGCGGGAA
CGCGGTCACA TCCTGCGCGA CTCGGGCGCC GCCCTGCTGC TCGGCAGGCC CGCCTGGGAC
AACCTCGCGA TCCCCACCGT GCCGGTCGAT CTCACCGCAC GGTCGGGGTC TGCCGGCTCC
GGCTCCGGGT CCGGGTCCGG GGAAACCGGA CCGGCGCTCA TCCTCTACAC CTCCGGGACG
ACCGGAGCCC CCAAGGGCGT GGTGCTGTCC GCCCCGGCCA TCGCCGCCGA TCTGGACGCC
CTCGCCGACG CCTGGGCCTG GACGCCCGAG GACACGCTCG TGCACGGGCT GCCGCTGTTC
CACGTCCACG GCCTGGTCCT GGGTGTGCTC GGGGCGCTGC GGGTCGGCAG CCGGTTGATC
CACACCGTCC GCCCGACCCC GACGGCGTAC GCGGCGGCCG GGGGGACCCT GTACTTCGGC
GTGCCGACCG TGTGGTCCCG GGTCTGCGAC GATCCGACCA CCGCCCGCGC CCTGGTCTCG
GCCCGGCTGC TCGTCTCGGG CAGTGCCCCC CTGCCGAGGC CGGTGATCGA CCGGCTCACC
GGGCTCACCG GCCTCGCCCC GATCGAACGG TACGGGATGA CCGAGACGTT GATCACCATC
AGCGCCCGGG CGGACGGGGA GCGCCGGGCG GGCTGGGTCG GTACCACCCT GCCGGGGGTG
CGGGCCCGGC TCGTGGACGA CGAGACGGGG ACCGAGCTGC CCGCGGACGG GGAGAGCATC
GGCGAGTTGC AGGTCCGCGG TGCCACCCTG TTCGACGGGT ACCTGGGGCG CCCGGAGGTC
ACCGCCGCGT CGTTCACCGC GGACGGCTGG TTCCGCACCG GTGACGCCGC GGTCGTCGCC
CCGGACGGCC ACCACCGAAT CGTCGGGCGC CGATCCACCG ACCTCATCAA GAGCGGCGGT
TACCGGGTGG GCGCGGGCGA GGTCGAGGCC GTACTGCTGG CCCACCCGGC CGTGCGCGAG
GCCGCCGTCG TCGGGCTGCC GGACGACGAC CTCGGGCAGC GCATCGCCGC CTTCGTCGTC
GCCCCCGACC TGGCCGGTGC GGCGGGCGGG GCACCAGGCG GGATACCGGG CAGGGCGTCG
GACGGGACAC CGAGCGAGAC GGCAAACGAG ACGGCGAGCG AGGCGCTCAT CGACTTCGTG
GCGCGGGAGC TGTCCATCCA CAAGCGGCCC CGGGAGATCC ACCTGGTGGC CGAGCTCCCC
CGCAACTCGA TGGGCAAGAT CCGCAAGTCC GCCCTGGCTC CCCCGGAAAC GCCCTGA
 
Protein sequence
MTLLPLPAGE HDGPAVRVGE VEFTRAELFA AASVVAGRVA GAPAVAVHAE ATMATVVSVV 
GCLLAGVPAV PVPPDSGPRE RGHILRDSGA ALLLGRPAWD NLAIPTVPVD LTARSGSAGS
GSGSGSGETG PALILYTSGT TGAPKGVVLS APAIAADLDA LADAWAWTPE DTLVHGLPLF
HVHGLVLGVL GALRVGSRLI HTVRPTPTAY AAAGGTLYFG VPTVWSRVCD DPTTARALVS
ARLLVSGSAP LPRPVIDRLT GLTGLAPIER YGMTETLITI SARADGERRA GWVGTTLPGV
RARLVDDETG TELPADGESI GELQVRGATL FDGYLGRPEV TAASFTADGW FRTGDAAVVA
PDGHHRIVGR RSTDLIKSGG YRVGAGEVEA VLLAHPAVRE AAVVGLPDDD LGQRIAAFVV
APDLAGAAGG APGGIPGRAS DGTPSETANE TASEALIDFV ARELSIHKRP REIHLVAELP
RNSMGKIRKS ALAPPETP