Gene Franean1_3556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3556 
Symbol 
ID5671925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4219798 
End bp4221447 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content77% 
IMG OID641242442 
ProductABC transporter related 
Protein accessionYP_001507862 
Protein GI158315354 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCA GAACGACCGG CGCACCGGAC GTCGGCCACA CCCCCGAGGG CTCCGAGGGG 
CCCGCCGCGG ACGTCGTCGT GACCGATCTG AGCATCGCCG CGGCCGGCGG GCGCGCGGTC
CTCCACCGCG TCTCCTGTGA CCTGCCCGCC GGTGGGACCC TTGCCGTCGT CGGCACCTCC
GGCGCGGGCA AGACGACGTT CGCGCTCGCG TTGGTGGGCC ATCTCGGGCC GGGCCTGACC
CGGACCTCGG GCGGCGTGAC GATCGGCGGG GTCGACGTGT TCACCCGGCG CGCATCCCGC
GCGCGGGAGC TGCGCCGCCA CCGGATCCGC TACCTGCCCC AGGATCCGGC GGCCTCGCTC
ACCCCGACCA TGCGCGTCTC AGCGCTCCTG TCGGAGATGA TCCGCCTCGT CGGCGGCCGT
CGGGCGGACG CGCGGGCGAG GGCCGCCGCG GCGTTGCGCG CGGTCGGGCT GCCCGACGAT
CCGACCTTTC TCGCGCGGTA TCCCCACCAG CTCTCCGGCG GGCAGCGTCA GCGGCTGCTG
CTCGCGCTGG CGCTGACCGG CGAGCCGGAC GTGCTCGTCC TCGACGAGCC GACCGCGAAC
GTGGACCCCG ACCAGGCGGC CGCCCTGCTG GCCCTGATCG AGCAGCGCCG GGCGGGGCGC
TCGTTCTCGC TCGTGCTGGT CAGCCATGAC CTGGCGGCCG TTGCCGCGCT GCCGGGCGCG
CCGGAGCTCG TCGTGCTGGA CGGCGGGCGG CTCGTGGAGC GGGGAGCGCC CCGCGACGTC
CTCGACCGGC CGCGCACCGG CCCGGCCCGG GCGCTGAGCA CCGCCAGCCG GCGGCTGAGC
CACCCACCGG AGCAGGCGAC GGCGCAGCCA CCGGCACAGG CGTCGCCGCG TGCGGCCGGG
CCCGAGCCCG GGCCCGCCGC TGTTCCCTCC CTCTCCCCCG GGCCGGACGC GGTGACGCTG
CGTGTCGCCG GGCTGCGGGT CTCCACCGGG ACGGCACGCC GCCGGGCCGA GGTGCTGCGC
GGTGTCGACC TGACCGTGCG GCGGGGCGAA TGCGTCGGCG TCGTCGGTGT CTCCGGCAGC
GGGAAGACCA CGCTGGCCCG TGCCGTCATC GGCCTGCACC CGTGGGACGG CGGGACCGTG
AAGCTCGGCG GAGTGCCGCT GGCACCCGCG GCGACGGACC GCCCGCCGCC GCAGCGCCGC
CGGATCGGCT ACGTCTTCCA GGACCCGTAC ACGTCTCTCA ACCCGCGCCG GCCGGTCGGT
GAGGCGGTGA CCCGCGCCTA CGCCCTCGCC GCGGGCGACG CTCGGCAGGG CGGACTGGGC
GAGGAGGTCG CCGCGCTGCT CGCGGATCTG GGGCTCGACC CCGAGCTGGC GGCCCGCCGC
CCGGAGCGGC TCTCCGGCGG CCAGCGCCAG CGGTTCGCCC TGGCCCGGGC CCTCGCCACG
GCTCCGGACC TGCTGATCTG CGACGAGGTG ACGTCGGCGC TGGATCCCGT GTCAGCCAGC
GCGATCTGCG GGCTGGTCCG TGGCCTGGTC ACCGAGCGCG GCCTGGCCGC CGTGTTCATC
AGCCACGACC GGGGTGCCGT CGGCGCGGTG GCCGACCAGG TCCGGGAGCT GCGGGACGGG
CTGCTCGCCG CCCCACCTCC CGGGCCCTGA
 
Protein sequence
MTTRTTGAPD VGHTPEGSEG PAADVVVTDL SIAAAGGRAV LHRVSCDLPA GGTLAVVGTS 
GAGKTTFALA LVGHLGPGLT RTSGGVTIGG VDVFTRRASR ARELRRHRIR YLPQDPAASL
TPTMRVSALL SEMIRLVGGR RADARARAAA ALRAVGLPDD PTFLARYPHQ LSGGQRQRLL
LALALTGEPD VLVLDEPTAN VDPDQAAALL ALIEQRRAGR SFSLVLVSHD LAAVAALPGA
PELVVLDGGR LVERGAPRDV LDRPRTGPAR ALSTASRRLS HPPEQATAQP PAQASPRAAG
PEPGPAAVPS LSPGPDAVTL RVAGLRVSTG TARRRAEVLR GVDLTVRRGE CVGVVGVSGS
GKTTLARAVI GLHPWDGGTV KLGGVPLAPA ATDRPPPQRR RIGYVFQDPY TSLNPRRPVG
EAVTRAYALA AGDARQGGLG EEVAALLADL GLDPELAARR PERLSGGQRQ RFALARALAT
APDLLICDEV TSALDPVSAS AICGLVRGLV TERGLAAVFI SHDRGAVGAV ADQVRELRDG
LLAAPPPGP