Gene Franean1_5169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5169 
SymbolligC 
ID5673503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6201061 
End bp6202287 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content73% 
IMG OID641244023 
ProductATP-dependent DNA ligase 
Protein accessionYP_001509433 
Protein GI158316925 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00454515 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGAGG ATCGGCGGGT GGACCTGCCT GTGACACCGC CGGTGAAGCC GATGCTCGCC 
CGCGCGGCGC CGCAGATCCC CCCGGACATG CTGTATGAGC CGAAGTGGGA CGGCTTCCGC
GCCCTGGTCT TCCGTGACGG GGCCGAGCTG GAGATCACCT CGCGCAACAC CCGGCCGATG
ACCCGCTACT TCCCCGAGCT GGTCGAGGTG CTGCTCGCGG CACTGCCCGA CCGCTGCGTG
CTTGACGGCG AGATCGTCGT CGTCGGCCCG AACGGACTGG ACTTCGAGGA GCTGTCGCAA
CGGGTGCATC CGGCGACCAG CCGGGTGGCG AAGCTCGCGC TGGAGACCCC GGTCTCGTTC
GTCGCGTTCG ACCTGCTGGC ACTCGGTGAC GAGGCGTTCA CGGACCAGCC GTTCGCCCGG
CGGCGCGCCG TGCTCGAGGA GGTTCTCGCC GGCCACGCCG GCCCGGCTGC GCCCGGAACG
GCACCGGCCC GGCGGATCCC GAGCGGGGTC TACCTCACGC CCTCGACCGG CGAGCTCGAC
ATGGCCCGGC AGTGGTTCGA GCTCTACGAA GGCGCGGGGC TCGACGGGCT GGTCGCCAAG
CCGCCGGACG GGGCGTACCA GCCGGACAAG CGCGCAATGT TCAAGATCAA ACATGACCGC
ACCGCCGACT GCGTCGTGGC CGGCTACCGG CCGCACAAGA ACGATCCGGA GGCGGTCGGG
TCGCTGCTGC TCGGGCTCTA CGCCGACCCC GCGGACGAGG CCGACCCCGA GAACGCGACG
GACCCGGCTC GGGAAAGCCC GCTGCTGTCC GTCGGGGTCA CCTCGGCCTT CCCGATGGCG
CGCCGGCGGG AGCTCGTCCG CGAGCTGGCC CATCTCGTGG TGCCGATCGA CTCCCACCCC
TGGGCCCGCC AGGGCCCGGA GAACGCCGCG CAGCCGGGCG GCGACGCGGG CGAGGAACCG
GCAGCGGCCG CGGGGCAGCC GGCGCGCACG CCCTGGGACG TCGGGGAGAG CCGGTGGGCC
CGTGGCCGTG ACCTCTCGTT CGTCCCGCTG CGGCCCGAGC TGGTCGTCGA GGTGCGCTAC
GACCACATGG AGGGACCGCG CTTCCGGCAC ACCACGCAGT TCGTCCGCTT CCGGCCCGAC
CGTGACCCCG GCGGATGCAC CTACGCCCAG CTCGAGCGTC CGGTGCGGTT CGACATCGCC
GACGTCCTGC GCATCCCGCC GGACTGA
 
Protein sequence
MREDRRVDLP VTPPVKPMLA RAAPQIPPDM LYEPKWDGFR ALVFRDGAEL EITSRNTRPM 
TRYFPELVEV LLAALPDRCV LDGEIVVVGP NGLDFEELSQ RVHPATSRVA KLALETPVSF
VAFDLLALGD EAFTDQPFAR RRAVLEEVLA GHAGPAAPGT APARRIPSGV YLTPSTGELD
MARQWFELYE GAGLDGLVAK PPDGAYQPDK RAMFKIKHDR TADCVVAGYR PHKNDPEAVG
SLLLGLYADP ADEADPENAT DPARESPLLS VGVTSAFPMA RRRELVRELA HLVVPIDSHP
WARQGPENAA QPGGDAGEEP AAAAGQPART PWDVGESRWA RGRDLSFVPL RPELVVEVRY
DHMEGPRFRH TTQFVRFRPD RDPGGCTYAQ LERPVRFDIA DVLRIPPD