Gene Franean1_3476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3476 
Symbol 
ID5671847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4132940 
End bp4134724 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content77% 
IMG OID641242364 
ProductABC transporter related 
Protein accessionYP_001507784 
Protein GI158315276 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0289183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0200783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCA TCACCGGAGC CGACGACGCT CCCGAACTGC TCTCGCCCCC GGCGGCGGCG 
CCGGAGTCCC CGCCGGAGTC CCCGCAGCCG CCGGAGCCGA CCGCGCTGGT CGAGGTGGAC
GGCCTCGACG TGCGGTTCGG CTCCGGCCCC GGCGCGGTGC ACGCCGTACG CGACGTCTCG
CTCACCCTGA CCGCGGGCCG CTGTCTGGCC CTCGTCGGGG AGTCCGGGTC CGGCAAGAGC
GCGCTGGCCC GCGCGCTGCT GGGCCTGGCC GGGCCGACGG CCACCGTCAC CGCCCGCCGG
CTGCGCATCG ACGACGCGGA CGCCCTCACC TTCCGGCCGC GGGACTGGCT GAAGGTCCGC
GGCCGGCGCA TCGGACTGGT GTCCCAGGAC GCGCTGGTCG CGCTCGACCC GCTCCGCCCG
ATCGGCCGCG AGGTCGCCGA GCCGATCCTG GCGCACCGGC TGCTCCCCCG GCGAGAGGTC
GAACCGGCCG TGCACGCGCT GCTGGAGCGG GTCGGCATCC CCGACCCGGC CGAGCGGGCG
CGCAGCTACG TCCACCAGCT CTCCGGCGGG CTGCGCCAGC GGGCGCTGAT CGCCTCCGCG
CTGGCCGCCG GACCCGGCGC GCTCATCGCG GACGAGCCGA CCACCGCGCT CGACGCCTCG
GTGCAGGCCC GCGTCCTCGG CCTGCTCGGG CAGCTCAAGG CGGACGGCAC CGGGCTGCTG
CTGATCAGTC ATGACCTGGC GGTCGTCGAG GCGCTGGCCG ACGAGGTCGC GGTGATGCGC
GAGGGGGTCG TCGTCGAGGC CGGGCCGGCC GCCGAGGTCC TGGCCCGCCC GCGGCATCCC
TACACGATCG CGCTGCTGGA CGCCGTCCCC GGCCGCCGCG GACGTCCGAA CGGCGTCGCC
CCGGCGACCG ACACCGCCGC GACCGCCGAC GCGACCTCGA CCGCCGACGC AGCGGGGGCG
GGCGACGTCC CGGGCGCCGA CGAGGCGGCG CCGCTGCTCG CGGTCGCCGG TGCCACCAAG
CACTTCCGCG GGCCGCGGGG CAGCCGGCGC ACGGCCGTCG ACGACGTCTC GTTCACCCTG
CGGGCCGGCG AGACGCTCGG GCTGGTGGGC GAGTCGGGCT CCGGGAAGTC CACCCTGGCC
GGCCTGGTGC TCGGCCTCGT CGCCGCCGAC GGCGGCGAGA TCCGGCTGGC CGGGCAGCCG
TGGAGCGGCA TGCCCGAGCG GGAGCGGCGG GAACGGCGCC ACCTGCTCCA GCTCGTCCCG
CAGGATCCAC TGAGCGCCTT CGACCCGCGC TGGACGGTCG CCCGCATCAT CGGCGAGGGG
CTGGAGGCGG CCGGGGTGGA GCGCCGGGAG CGACGCGCGC AGGCCCTGAC CCTGCTCGAA
CAGGTCGGGC TGTCCGACAT CCACCTGGAC CGCCGCCCGC TGGCGCTGTC CGGGGGGCAG
CGGCAGCGGG TGGCGATCGC CCGCGCGCTG GCGACCCGGC CGCGGCTGCT CGTGTGCGAC
GAGCCCGTCT CCGCCCTCGA CGTCTCGGTG CAGGCTCAGG TGCTGGCCCT GTTCCGGGAG
CTGAGCGACT CCCTCGGGCT GGCCACCCTG TTCATCTCGC ACGACCTGGC CGTGGTGCGG
GAGGTCTGCG CCCGGGTGCT GGTGATGAAG GACGGCCGGA TCGTCGAGAC CGGACCGGTG
GAGCAGGTGT TCGCCGACCC GCGGCACCGG TACACCCAGG AGCTGCTCTC CGCCGTCCCC
GGCAAGGCGG GGCGGGCCTA CCGCGCGCGT TTCGGAACCG GCTGA
 
Protein sequence
MTAITGADDA PELLSPPAAA PESPPESPQP PEPTALVEVD GLDVRFGSGP GAVHAVRDVS 
LTLTAGRCLA LVGESGSGKS ALARALLGLA GPTATVTARR LRIDDADALT FRPRDWLKVR
GRRIGLVSQD ALVALDPLRP IGREVAEPIL AHRLLPRREV EPAVHALLER VGIPDPAERA
RSYVHQLSGG LRQRALIASA LAAGPGALIA DEPTTALDAS VQARVLGLLG QLKADGTGLL
LISHDLAVVE ALADEVAVMR EGVVVEAGPA AEVLARPRHP YTIALLDAVP GRRGRPNGVA
PATDTAATAD ATSTADAAGA GDVPGADEAA PLLAVAGATK HFRGPRGSRR TAVDDVSFTL
RAGETLGLVG ESGSGKSTLA GLVLGLVAAD GGEIRLAGQP WSGMPERERR ERRHLLQLVP
QDPLSAFDPR WTVARIIGEG LEAAGVERRE RRAQALTLLE QVGLSDIHLD RRPLALSGGQ
RQRVAIARAL ATRPRLLVCD EPVSALDVSV QAQVLALFRE LSDSLGLATL FISHDLAVVR
EVCARVLVMK DGRIVETGPV EQVFADPRHR YTQELLSAVP GKAGRAYRAR FGTG