Gene Franean1_3763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3763 
Symbol 
ID5672128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4459408 
End bp4462362 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content74% 
IMG OID641242644 
ProductABC transporter related 
Protein accessionYP_001508064 
Protein GI158315556 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.849502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGAGG GCTCCCTCGC GCACCCGCGG CCGGCCCTGA TCGCCACCCT CGCCGGCATC 
GTCAACGGGA CCACCATGAT CCTCGGGGCC GCCGCCATCG GCTGGGCCAC CGACCATCTG
ATCGTCCCGG CCCTCGCCGG CGGCCACGTC GCCCGCGCCA CCTGGTGGAT CGCGGTCGGC
GCGATCCTCG GCGTCTCCAC GGTGCGCTGG ATGACCATCG TCATCCGCGG CATCGCGACC
GGGTACGTGC AGCACGGCTC GCAGGCGCGG GTCCGGCGGT CCGTCGTCGG CCGGTATCTC
GAGCTCGACC TGGCCTGGCA CCGTCGGCAC CCGCCCGGTC GCCTGCTGTC CACCGCGGTG
TCCGACGTGG ACGCGCTGTG GTTCCCGATG GTCTTCTACT ACTTCGCGCT CGGGATGATC
GTCATGCTGG TCGTCGCGAT CGTCCAGCTC TTCGGGCACG ACACCGCCCT CGGGCTGGTC
GGGGTCGGCC TCGTCGGCTC CGTCCTCGGG GTGAACCTGC TCTACCAGCG CCTGCTCAGC
CCGCGCGCCC GGGCGGTGCA GGACAGCCGC GGCGAGTTCG GCGCGCTCGC CCTGGAGAGC
ATCGAGGGCG GCCAGGTCGT CCGGACCCTC GGCATCGCCG ACCGCGAGCG GGCCCGGGTC
GGTGCCGCGG CGCTCCGGCT GCGCGCGGCC ACCACCGCGG CCGGCGACCT CAGCTCGGTG
TTCGACCCGC TCCTGGAGGT GCTGCCGACC GCCGCGGTCA TGGCCGTCCT CGCCGTCGGC
TCGCGCCGGG TCGAGACCGG CGACCTCAGC GTCGGCGTCC TCGTCGAGGT CGTCTACCTG
CTGCTGACCA TCTCCATCCC ACTCAACGTG ATCAGCCGTT TCCTGGGGAT GCTGCCGGTC
TCGGCGGCCG GCCGCACCCG CGTCGCCGCC GTGCTCGACG CCGCCGAGAC CACCGCCCAC
GGAGACCGCG CCCTGCCCAG CGGCGCCGCT CCCGGCCCGC GGGCGCCGGC TCCGCGGGCG
CCGGGCGTCG GGCTCGTCCG CGGCGGGACG AGCCTTCTCA CGGACGTCGA CATCGAGGTG
CGCCCCGGCG AGATCGTCGC CATCGTCGGG CCGACCGGCT CGGGCAAGAC CACCCTGATC
GAGCTCCTCA GCCGCCAGGT CGACCCCACC GACGGCGTCG TCGAGATCGG CGGGGTTCGC
GCCACGGACC TCGCCCGCGG GGAGATCTCG TCCCAGCTGG CCGTTGTCGG GCAGACCTCG
TGGCTGTTCG GCGGCAGCGT CCACGCCAAC CTTCAGCTCG ACGGCCATCC CCGCGAACGG
CGTCCCTACA CCGCCGGTGA GATCTGGCGG GCGCTGGCCG CCGCCGGCGC CGATGACGTC
GTCCGTGACC TGCCCAACGG CCTCGACACC CGGGTCGGCG AGCGCGGCGC CCGGCTCTCC
GGCGGCCAGC GGCAGCGGCT GTGCCTGGCC CGCGCGCTGT TGCGCGAGCC CGGCGTGCTG
CTGCTGGACG ACGCCACCTC GGCCCTCGAT CGGCGCACGG AGGCCGCGCT CGCCGAGCTG
CGCGCGGCCG GCGTCGCGCG CATCCACGAG ATGGCCGCGG AGACGTTCGC GCGCACCGGC
AGCGCCGACC TCGTCAGCCG GCTCACCGGC GACGTCGACG CCGTCACGAC CTTCGTGCAG
AGCGGCGGCG TCATGCTGCT CGTCAACGTC ACCCAGATGA TCATCGCCGG CGTGCTGATC
GCGGTGTACT CCTGGCAGCT GGCGGTGCCC GTGCTGGCGA CGGCGGTGCT GTTGTTCGTC
GCGATGCGTC GCGTGCAGGC GCTGGTGGCG CGCCGGTTCA CCGTGGTCCG GGAGAGCGTG
TCAGCGCTGC AGTCCACCGT GGGTGAGGCC GTCACCGGCA TCAGCGTCAT CCGCTCGACC
GGCACCGAGG CACGCAGCCG CGCCATGGTC GAGGACGCCG TCGAGCACAC GGCGTCGGCG
CAGCGCCGCA CCCTCGTCCC GCTGCACTTC AACACCGCCT TCGGTGAGAT CGCGATCTCG
TTCGTCACCG TGGTGGTGAT CGTCGCGGGC GTCCGCTGGT CGACCGCCCA CACCCGGTGG
GAGCCGACGC TGCACCTGTC GGCCGGTGAG CTGGTGGCGA TGCTCCTGCT CGTCACCTTC
TTCGTCCGGC CGTTGCAGAT GCTGGTCCAG ATGCTCGGCG AGGCCCAGAA CGCCGTCGTC
GGCTGGCGCC GCGCGCTGGA GATCCTCATC GCCGCGGGCG AGCACGTCGC CGTCGTCGGC
GAGACCGGCT CGGGCAAGAG CACCTTCGCC CGGCTGCTGA CCAGGCAGAT CGCGCCACGC
CACGGCCGTG TCCTGCTCGG CGGCCTCCCG GCCGGCCAGG TGTCGGACCT GTCCTTCCAG
CGCCGCGTCG CGGTCGTCCC GCAGGACCCG TTCCTGTTCG ACGCCACGAT CGCCGACAAC
ATCCTCGCCG GGGTCCGCGG CGACGCCGGG GCGCTCGACG AGATCGTCGA CTCGCTCGGC
CTGCGGCCGT GGATCGCCAC CCTGCCCGAG GGCCTCGACA CCCGCGTCGG CACACGCGGC
GACCGGCTCT CCGCCGGCGA GCGCCAGCTC GTCGCCCTCG CCCGCACCGC CCTGGTCGAC
CCCGACCTGC TCGTGCTCGA CGAGGCGACC AGCGGCGTCG ACCCCGCGAC CGACGTCCGC
GTGCAGCACG CGCTCGGCGC GCTCACGGTC GGGCGGACCA CGGTCTCGAT CGCCCACCGC
ATGGTGACGG CCGAGCGCGC CGACCGCGTT CTCGTCTTCG ACCACGGGCG CCTCGTCCAG
AGCGGGCGTC ACGACGACCT CGTCCGCGTC CCCGGCCACT ACGCCCGGCT GCACGCCGCC
TGGGTCGAGA ACACCGCCGG CGCCGACGAG CGCCACCAGA CCCTCCTCCA CCACAATGAC
GGGAGCACCC CGTGA
 
Protein sequence
MLEGSLAHPR PALIATLAGI VNGTTMILGA AAIGWATDHL IVPALAGGHV ARATWWIAVG 
AILGVSTVRW MTIVIRGIAT GYVQHGSQAR VRRSVVGRYL ELDLAWHRRH PPGRLLSTAV
SDVDALWFPM VFYYFALGMI VMLVVAIVQL FGHDTALGLV GVGLVGSVLG VNLLYQRLLS
PRARAVQDSR GEFGALALES IEGGQVVRTL GIADRERARV GAAALRLRAA TTAAGDLSSV
FDPLLEVLPT AAVMAVLAVG SRRVETGDLS VGVLVEVVYL LLTISIPLNV ISRFLGMLPV
SAAGRTRVAA VLDAAETTAH GDRALPSGAA PGPRAPAPRA PGVGLVRGGT SLLTDVDIEV
RPGEIVAIVG PTGSGKTTLI ELLSRQVDPT DGVVEIGGVR ATDLARGEIS SQLAVVGQTS
WLFGGSVHAN LQLDGHPRER RPYTAGEIWR ALAAAGADDV VRDLPNGLDT RVGERGARLS
GGQRQRLCLA RALLREPGVL LLDDATSALD RRTEAALAEL RAAGVARIHE MAAETFARTG
SADLVSRLTG DVDAVTTFVQ SGGVMLLVNV TQMIIAGVLI AVYSWQLAVP VLATAVLLFV
AMRRVQALVA RRFTVVRESV SALQSTVGEA VTGISVIRST GTEARSRAMV EDAVEHTASA
QRRTLVPLHF NTAFGEIAIS FVTVVVIVAG VRWSTAHTRW EPTLHLSAGE LVAMLLLVTF
FVRPLQMLVQ MLGEAQNAVV GWRRALEILI AAGEHVAVVG ETGSGKSTFA RLLTRQIAPR
HGRVLLGGLP AGQVSDLSFQ RRVAVVPQDP FLFDATIADN ILAGVRGDAG ALDEIVDSLG
LRPWIATLPE GLDTRVGTRG DRLSAGERQL VALARTALVD PDLLVLDEAT SGVDPATDVR
VQHALGALTV GRTTVSIAHR MVTAERADRV LVFDHGRLVQ SGRHDDLVRV PGHYARLHAA
WVENTAGADE RHQTLLHHND GSTP