Gene Franean1_3960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3960 
Symbol 
ID5672321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4741066 
End bp4742973 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content74% 
IMG OID641242839 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001508256 
Protein GI158315748 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.522834 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0456544 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGCAGT ACACCCTGGG AGCGGGCCGG GTCCTTCCTG ACGACCGCGG TCTTTATTCC 
CTCGTCGACG ACCGCGTCCG TACCGATCCG GACCGGGTGA TCCTCAGCCG GCCCACCGGG
ACGGACTGGC ACGACGTCAC CTTCCGCGAG CTCGACGCCC ACGTGCGCCG GGTCGCCGCG
GTGCTCCTCG GCCACGGGGT GGGCGTGGGC GACCGCGTCG GCATCGTCGG CCGCACCAGC
TACGAGTGGG TGGTCGCCGA CCTCGCGGTC CTCGCCCTCG GCGCGATCAC CGTGCCGATC
TTCCCGACCG CCTCGCCGGC GCAGATCACC CACATCGTGA CCGACTCCGG CATGGCGTGG
TGCTTCGTCG AGACCCCGGA GCACCACAGC GCGGTCGCCA CGGCCGGCGC CGGGACGCTG
CTCGCGCTGC CCTGGCAGCT GGCCGACCTG AACGGCTGGC AGGCCCCCGC CGACGACGCG
AGCCACGACG CGGGCCACGC GGCCGGCGGC GGTGACCTGG CCGCGGCGGA CGAGTTCGCC
GGGCGCCGGG ACGCCGTCCG GGCGGACTCG CTCGCGACGA TCGTCTACAC CAGCGGGACG
ACCGGCATGC CCAAGGGCTG CATTCTCACC CACGGCAACC TGTTCGCCTC CAGCGCCAAC
ACCGTCGAGC ACACCGGCGA GCTGTTCCGG GTGTGGCGGC CCGCCGCGGG CACCGGAGCG
GCCGAGGGGT CGGCGGGATC AGAGGCGTCG GTCGAGCAGG CGTCGACCCT GCTCTGCCTG
CCGCTGGCAC ACGTCTTCGG CCGGACGATC CTCGTCGCCT GCATCTATGC CGGAACGCGG
ACCGGCCTGC TGGCGGCGGT CCCGGACATC CTGCCCGCGA TGGCGACCTT CCGGCCGACC
GTGCTCGCGC TCGTCCCCTA CGCCCTGGAG AAGATCCGCA AGGGGCTGCG CGGCGTCGTC
GACACCGACA CCGAGCTCGC GGCGGTGGCC GCCGGGCTGG CCGGCGACCC GGCCGACGGC
CCAGCCACCA GCCCGGCCGC CGAGCGGCGG CCCGCCCCGG CTGCGCTGGC CCGGATCAAC
GGCGTGTTCG GTGGCCGGCT GACCCACGTG ATCAGCGGCG GCGCGTCGCT GGACGCCACC
ACGGCGGCCT TCTACCGCGG GGTCGGCGTG CGGATCCTGA ACTGCTACGG CCTCACCGAG
GCCGCGACGG CGGTGACCGT CAACCAACCC GGCACCAACC GCATCGGCAC GGTCGGCCAG
CCGATCCCCG GCACCACCGT GGCGATCTCC CCGGACGGCG AGGTGCTGGT CGCCGGCCCC
AACGTCTCCC CCGGCTACTG GCGGGCCGGC CAGGCCCCGT GGATCGAGCC CGCCGGCACC
CAGGCCGAGC CCGGCGCCGG CCCGTCCCGC TGGCTGCACA CCGGCGACCT GGGCCACCTC
GACGCCGACG GCTTCCTCGT CATCACCGGC CGCCGCAAGG AAATCCTGGT CACCAGCGGT
GGCAAGAACG TCACGCCGAC CCTGCTCGAG GACAGGATGC GGCTGCACCC GCTGGTCGCC
GACTGCATGG TCGTCGGTGA GGCCCGCCCG TACGTCGCCG CCCTGGTGAC GACCGACCAG
GGTGCGCTCA ACGCCCTGGC CGCGGCGCAC GGCATCGACC TCGCCGCCTC CGGCTGGTGG
GAGCACCCCG CCCTGCTCGA CCGGGTGCAG GAGGCGGTCG ACGACGCCAA CGGCCTGGTC
TCCCGGGCCG AGTCCATCCG CCGGTTCCGA ATCCTCCCCA CCCAGCTCAC CATCGACGCC
GGTCACCTGA CGCCCTCGAT GAAGCTGCGC CGCGCACCGA TCGAGACGGC CTTCGCCACC
GAGATCGAGC AGCTCTACTC CCCGGCCGCC GCCCCGGCAC CGGCCTGA
 
Protein sequence
MQQYTLGAGR VLPDDRGLYS LVDDRVRTDP DRVILSRPTG TDWHDVTFRE LDAHVRRVAA 
VLLGHGVGVG DRVGIVGRTS YEWVVADLAV LALGAITVPI FPTASPAQIT HIVTDSGMAW
CFVETPEHHS AVATAGAGTL LALPWQLADL NGWQAPADDA SHDAGHAAGG GDLAAADEFA
GRRDAVRADS LATIVYTSGT TGMPKGCILT HGNLFASSAN TVEHTGELFR VWRPAAGTGA
AEGSAGSEAS VEQASTLLCL PLAHVFGRTI LVACIYAGTR TGLLAAVPDI LPAMATFRPT
VLALVPYALE KIRKGLRGVV DTDTELAAVA AGLAGDPADG PATSPAAERR PAPAALARIN
GVFGGRLTHV ISGGASLDAT TAAFYRGVGV RILNCYGLTE AATAVTVNQP GTNRIGTVGQ
PIPGTTVAIS PDGEVLVAGP NVSPGYWRAG QAPWIEPAGT QAEPGAGPSR WLHTGDLGHL
DADGFLVITG RRKEILVTSG GKNVTPTLLE DRMRLHPLVA DCMVVGEARP YVAALVTTDQ
GALNALAAAH GIDLAASGWW EHPALLDRVQ EAVDDANGLV SRAESIRRFR ILPTQLTIDA
GHLTPSMKLR RAPIETAFAT EIEQLYSPAA APAPA