Gene Francci3_3097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3097 
Symbol 
ID3904223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3668299 
End bp3670098 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content71% 
IMG OID637880418 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_482183 
Protein GI86741783 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.658552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.476459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGAGT ACACCTCCCC GCCCCTGGTC ACCATCGCCG ACGATGCGAC CCTGACGGAC 
GCCGTGTTCC GCAACGCGGC GGCACATCCG GACAGTACGC TTATCCAACA CAAGATCGAC
GACGAGTTCG TCGGCATGAC CGTCCGCGAG TTCCACGATC ACGTCGTGGC CACCGCCCGG
GGCTTCATCG CCCGCGGGGT GCGGCCCGGG GACCGCGTCG GTCTGGCGAG CCGGACCCGG
TTCGAGTGGA CGATTGTCGA CTACGCGGCG TGGCTGGCCG GAGCCGTCTG CGTGCCGATC
TACGAGACCT CCAGCCCGGG CCAGATCGAG TGGATCCTGC GGGACGCCGG CGTCGAGGTT
CTCGTGGTCG AAAACGACGA GCTCGCCGAG CGGGTCGCAC AGATACGCGA CGGGGTCCCC
GCGCTGCGGG AGGTCCTCGT CATCGAGCAC GGGGCGCTGG CCGGTCTCGC CGTGGACGGC
GCAGGCATCG CGCCGGAGCG GCTGACGGCG GCCCGGGCGT CCGTCAACGC CGACAGCCTC
GCGACGATCA TCTACACCTC GGGGACCACC GGCCAACCGA AGGGCTGCGA GATCACCCAT
CGCGCCCTGC TGTTCACCGC CGAGGCGGCC ATCGCCACGC TGCCCGAGCT GTTCGCGCCC
GGTGCGTCCA CACTGCTGTT CCTCCCGCTG GCGCACGTGT TCGCCCGCAT GCTCCAAGTG
GGCGTGGTCC AGGGGGCGTT CACCCTCGCC TACACCCCGG ACTCGCGGAC CCTGCTGCCC
GATCTCGCCA AGGTACGCCC GACCTTCCTG CTCTCCGTGC CCCGGGTGTT CGAGAAGGTG
CACGCCGGGG CACGCCACAA GGCGCACGCC GAGGGCAGGG GTTGGATCTT CGACGCCGCC
GAGAACACCG CGGTCGCCTA CAGCCGGGCT CTCGACGGCG GCGGCCCCGG CCTGCTCCTG
CGGCTGCGGC ACCGGCTGTT CGCCGCGCTG GTCTACGGCA AGCTCCAGGC TGCGCTCGGT
GGGCGGGCCC GCTACGCGGT CAGCGGCGGG GCGCCGCTGG GTGAACGCCT CGGCCACTTC
TTCCGCGGGA TCGGGTTCAC GGTGCTCGAA GGCTACGGTC TGACCGAGAC CAGCGCGCCG
GCCGCGGCGA ACCGGCCGGG CAACGTCCGC ATGGGCACCG TCGGCCAGCC CTTTCCTGGC
GTGACGATCG CCATCGCCGA TGACGGGGAG ATCCTCATCC GCGGTCCCCT GCTGTTCCGC
GGCTACCGCA ACAACGAGCT CGCGACGAAG GAGGCGCTCG ACGCCGAGGG CTTCCTGCAC
ACCGGCGACC TCGGCGACCT CGACGCCGAC GGCTTCCTGC GGATCACCGG CCGCAAGAAG
GAGCTGCTGG TCACCGCCGG TGGGAAGAAC ATCGCACCGG CGCCACTGGA GCACATCATC
CAGTCCCATC CCCTGGTCAG CCAGGCGATG CTGATCGGGG ACCGGCGGCC CTTCGTCGCC
GCCCTCGTCA CGCTCGATCC CGAGGCGTTC GACCGGTGGC GGTCCTCGGC GGGCAAGCCG
GCCGGCGCGA CGGTTGCCGA CCTGATCGAC GACGCCGGCC TGCGCACCGA GATCCAGAAT
GCGATCGACG CCGCGAACGC GACAGTGTCG CATGCCGAGA GCATCAAGAA GTTCGCGATC
CTGCCGCAGG ACTTCACGGT GGAGACCGGC GAGCTCACCC CGAGCCTGAA GGTGCGCCGC
TCGCTGGTGC TCGACCGGTT CAGCCAGGCG GTGGAGGACA TCTACGCTAC CCCGCGCTAA
 
Protein sequence
MREYTSPPLV TIADDATLTD AVFRNAAAHP DSTLIQHKID DEFVGMTVRE FHDHVVATAR 
GFIARGVRPG DRVGLASRTR FEWTIVDYAA WLAGAVCVPI YETSSPGQIE WILRDAGVEV
LVVENDELAE RVAQIRDGVP ALREVLVIEH GALAGLAVDG AGIAPERLTA ARASVNADSL
ATIIYTSGTT GQPKGCEITH RALLFTAEAA IATLPELFAP GASTLLFLPL AHVFARMLQV
GVVQGAFTLA YTPDSRTLLP DLAKVRPTFL LSVPRVFEKV HAGARHKAHA EGRGWIFDAA
ENTAVAYSRA LDGGGPGLLL RLRHRLFAAL VYGKLQAALG GRARYAVSGG APLGERLGHF
FRGIGFTVLE GYGLTETSAP AAANRPGNVR MGTVGQPFPG VTIAIADDGE ILIRGPLLFR
GYRNNELATK EALDAEGFLH TGDLGDLDAD GFLRITGRKK ELLVTAGGKN IAPAPLEHII
QSHPLVSQAM LIGDRRPFVA ALVTLDPEAF DRWRSSAGKP AGATVADLID DAGLRTEIQN
AIDAANATVS HAESIKKFAI LPQDFTVETG ELTPSLKVRR SLVLDRFSQA VEDIYATPR