Gene Francci3_2165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2165 
Symbol 
ID3906765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2535260 
End bp2536795 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content68% 
IMG OID637879498 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_481264 
Protein GI86740864 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.708059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGTGC CTTTCGGCGC ACGTGATTTC CTTGATCGAG CGGTCACGGT GTATGGCAGA 
CGTGTCGGTG TCGTGGACGA ACCGGACCAG CCCGCCCAGC CGTGGCCGGA CCTGACCTAC
GCCCGCCTGG GTGAGACCGC CCGTGCGCAG GCGGCGGGCC TGGACGCCTT AGGCGTGGGC
CACGGTGAGC GCGTCGCGAT CGTGTCGCAT AACAGTGCCC GCCTGCTCGC GTCGTTCTTC
GGTGTCAGCG GGTACGGCCG GGTGCTCGTC CCGGTGAACT TCCGTCTCTC GCCGGAGGAG
GTCAGCTACA TCGTCGAGCA CTCCGGCTCC GAGGTGCTGC TCATCGACCC CGAGCTGGAG
GAGAAGCTCT CCGGCGTGAC CGCGAAGCAG AAGTTCGTGC TGGGCGCCGA GAGTGACGCC
GAGCTCTACC GCTTCGACGC CGAGCCCACG CCGTGGGAGC CGGATGAGAA CGCCACCGCC
ACGGTCAACT ACACCAGCGG TACGACCGCG CGCCCCAAGG GCGTGCAGAT CACCCACCGC
AACATCTGGG TCAACGCGGT CACCTTCGCG CTGCACGCCG GTGTCACCGA TCGCGACGTC
TACCTGCACA CGCTGCCGAT GTTCCACGCC AATGGCTGGG GCATGCCGTT CGGCATGACC
GGTCTCGGCG TCCAGCAGGT GGTGCTGCGC AAGATCGACG GCCCGGAGAT CCTGCGCCGC
GTGGAGCAGC ACGGCGTCAC CGTGATGTGC GCCGCGCCGG CTGTGGTCAA CGCCGTGCTC
GACGCCGTCC GGGACTGGGA CGGCGAGGTC CCCGGCCGCG ACCGCGTGCG GGTCATCTGC
GCGGGTGCTC CGCCGCCCAC CAAGACGATC CAACGGGTCG AGGAGGAGCT CGGCTGGGAG
TTCATCCAGA TTTACGGCCT CACCGAGACC TCGCCGCTGC TCACGATCAA CCGTTCCCGC
GTGGAGTGGG ACGACCTGCC GCCCGAGGAT CGCGCGGGCA AGCTCGTCCG TGCGGGTGCG
CCCGCTCTGG GTGTCACGCT CAAGCTCTCA GACTCCGGTG AGGTGCTGGC TCGCTCCAAC
GTGATCTTGG CGGGCTACTG GGAGCGGCCG CAGGAGTCGG CCGGGGCCCT GGCGGGCGGC
TGGTTCCACA CCGGTGACGG CGGCGTGATC GACGACGAGG GCTACCTGAC GATCAGTGAC
CGCAAGAAGG ATGTGATCAT TACCGGCGGT GAGAACGTCT CGTCGATCCA GGTCGAGGAC
TGCCTGTTCG GCCACCCGGC AGTGGCCGAG GTCGCGGTGA TCGGCGTGCC CGACGAGAAG
TGGGGTGAGG CGATCAAAGC GCTGGTCGTG CTCGCCGAGG GCCGGACGGC GACCGAGGCG
GAGCTGATCA AGCACTGCAA GGAGCGGCTC GCCTCCTACA AGGCGCCCAC CTCGGTGGAG
TTCCGCGACC AGCTTGCCCG CACCGCCACC GGCAAGCTGC AGAAGTTCAA GCTGCGCGCC
CCGTACTGGG AGGGCCACGA GCGCCAGGTC GACTGA
 
Protein sequence
MLVPFGARDF LDRAVTVYGR RVGVVDEPDQ PAQPWPDLTY ARLGETARAQ AAGLDALGVG 
HGERVAIVSH NSARLLASFF GVSGYGRVLV PVNFRLSPEE VSYIVEHSGS EVLLIDPELE
EKLSGVTAKQ KFVLGAESDA ELYRFDAEPT PWEPDENATA TVNYTSGTTA RPKGVQITHR
NIWVNAVTFA LHAGVTDRDV YLHTLPMFHA NGWGMPFGMT GLGVQQVVLR KIDGPEILRR
VEQHGVTVMC AAPAVVNAVL DAVRDWDGEV PGRDRVRVIC AGAPPPTKTI QRVEEELGWE
FIQIYGLTET SPLLTINRSR VEWDDLPPED RAGKLVRAGA PALGVTLKLS DSGEVLARSN
VILAGYWERP QESAGALAGG WFHTGDGGVI DDEGYLTISD RKKDVIITGG ENVSSIQVED
CLFGHPAVAE VAVIGVPDEK WGEAIKALVV LAEGRTATEA ELIKHCKERL ASYKAPTSVE
FRDQLARTAT GKLQKFKLRA PYWEGHERQV D