Gene Franean1_6820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6820 
Symbol 
ID5675133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8308341 
End bp8310302 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content73% 
IMG OID641245669 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001511060 
Protein GI158318552 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCTG ACACGACGAC ATCCTTCGAG GTGGTCGGCT CGGCCGACGA GGCGGCGGAC 
ATCCGGGCCG CGGTCGCCGG CCTGACCGTC CCCATGCTGT TGGCCCGCCA GTGCCGCTCC
CGCCCGGACG CGGTGAGCAT CCACTGGATG CGCGACGGCG AGTGGCAGGC GTGGACGTGG
CGCCAGTACG GCGCGCAGGT CGCGCGGATG GCCGGCGCGC TGGAGCGTCT CGGGTTCGGC
CGCGGTGACC GGGCGCTGCT GATGACGTGC CCGCGGCCGG AGTTCCACGT CATCGACTCC
GCCGTGCTCC TGCTTGGCGG CTGCCCGATC TCGATCTACA ACTCGTCGCC GGCGGAGCGG
GTGCGTTACC TCGCCGCGCA CTGCCGGGCC AGCCTCGTCG TCGTCGAGGG ACGCGAGCTG
CTTGCCCGGG TGCTGGCCGT GCGCGCCGAG CTGCCCGACC TGCGGCACGT CGTCGTCATC
GACCCGGCTC CCACCGGCAG CGGCGGTACC GGCAGTGAGG CCGCCGGCGG CGAGAACCTC
GGCGGCGGGG CCGCCGGTGA CGTGCCGCCG GGTGTGCTCC GCTGGGACGA CCTGCTCGCG
GCCGACCCGG TCGACCTCGA GGCGCGGGCC GCGACCGCCG GCCCCGACGA CCCGTGCACC
GTGATCTACA CCTCGGGGAC GACCGGCGTC CCGAAGGGCG TCATGCTGGA TCACAGGGCG
GTGATCTGGC AGTGCGAGAG CTATCTGCGC CGGCTCGACC GTGACCTGAC CGGTGCCCGC
TGGGTCAGCT ACCTACCGGT GGCACACATC GCGACCCGGT TCTACGCGCA GTACTTCCAC
GTCTACGCGG GCCTGGAGGA CATCACCTGC CCGAACCCGG CGGACGTCGA ATCGGTGCTG
ATCGACCGGC CGCCGCACCT CTTCTTCGCG CCGCCGCGGC TGTGGGAGAA ATACCTGATC
TCCCTGCGGG AGTGGATCGG CGCCATGGAC GACCCGGCCC GCGCGACGCG GCTCACCCGC
GCCATGCGAG TCGGCGAGGA GGCGGCGCTG CTCGAGCTCA CCGGCGGGGA CGTGCCACCC
CGGCTCGCCC GGGAACGGGA GGAGCTGCGG CCCGCCACCC GGGAGCTGCG CACCCGCCTC
GGGCTCGGCA ACCTGCTGAC CGGGGCGATC GGGGCCGCGC CCGCGAACCC CCAGCTCACC
GCGGCGTGGA TCGGCCTCGG CGTGCCGATG TTCGAGGGCT ACGGGCTGAG CGAGTCGACC
GGCATGCTCA CGGTCGACCC GTTCGCCTAC CGCCTCGGCA GCGTCGGCCG CCCGATGCCC
GGCGTCGAGC TGCGCGTCGC GCCGGACGGC GAGCTGCACT TCCGCGCCGG CAGCGCGTTC
CGGAGCTACC TGGACGACCC CGAGCAGACC ACCGCCGCCA TCGATGCGGA GGGCTGGGTG
CGCACCGGTG ACCTGGCCAC CATCGACGAC GGCTACGTCC GGCTGCGGGG ACGTAAGAAG
GAGCTCATCA TCACCGCCGG AGGCGAGAAC GTCTCACCTG TGGCGGTCGA GTTCGCGCTC
GCGGCGCAGC TGCTGGTGGG GCAGGTGTGC GTGCTCGGCG ACGGCCAGCC GGCGCTGGGT
GCTCTCGTCG TCCTGGACCC CCAGGCCGCC GCGGTCTGGG CCGGCGTGAA CGGGATCCCG
TTCACGCACG TCGACGAGCT CGCGGCCGAT CCGCGGGTGC TGGCCGAGGT CGGGCGGCAG
ATCGCCCGGG CCAACGACCT GCTCGTGCGC CAGGAGAAGG TCCGGCGGTT TCACGTGCTG
CGCCACGAGT GGCCGCTCGA CTCCGACGAG CTCACGCCCA CCGCGAAACT CCGCCGCGAT
CAGATCGCGG CGAAATACCA GGCCGAGATC GCGGCGATGT TCGCGGCGCC CGCCAATGTC
GAGATCATGC CGCGTGCGGT GGCCGCGAAC CATTCCTCGT GA
 
Protein sequence
MAADTTTSFE VVGSADEAAD IRAAVAGLTV PMLLARQCRS RPDAVSIHWM RDGEWQAWTW 
RQYGAQVARM AGALERLGFG RGDRALLMTC PRPEFHVIDS AVLLLGGCPI SIYNSSPAER
VRYLAAHCRA SLVVVEGREL LARVLAVRAE LPDLRHVVVI DPAPTGSGGT GSEAAGGENL
GGGAAGDVPP GVLRWDDLLA ADPVDLEARA ATAGPDDPCT VIYTSGTTGV PKGVMLDHRA
VIWQCESYLR RLDRDLTGAR WVSYLPVAHI ATRFYAQYFH VYAGLEDITC PNPADVESVL
IDRPPHLFFA PPRLWEKYLI SLREWIGAMD DPARATRLTR AMRVGEEAAL LELTGGDVPP
RLAREREELR PATRELRTRL GLGNLLTGAI GAAPANPQLT AAWIGLGVPM FEGYGLSEST
GMLTVDPFAY RLGSVGRPMP GVELRVAPDG ELHFRAGSAF RSYLDDPEQT TAAIDAEGWV
RTGDLATIDD GYVRLRGRKK ELIITAGGEN VSPVAVEFAL AAQLLVGQVC VLGDGQPALG
ALVVLDPQAA AVWAGVNGIP FTHVDELAAD PRVLAEVGRQ IARANDLLVR QEKVRRFHVL
RHEWPLDSDE LTPTAKLRRD QIAAKYQAEI AAMFAAPANV EIMPRAVAAN HSS