Gene Franean1_2204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2204 
Symbol 
ID5670603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2636199 
End bp2637812 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content71% 
IMG OID641241124 
ProductABC transporter related 
Protein accessionYP_001506545 
Protein GI158314037 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.246481 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATCG TGTCCGAGCT CGAGCTCCGT GCCGGAGCCC GCACACTGAT CGAGCCGGTC 
TCGTTCCGGG TGCAGCCGGG TGACCGTATC GGCCTCGTCG GCCGCAACGG CGCCGGCAAG
ACGACGATGC TGAAGACGCT CGCGGGGGAG ACCCTGCCCT TCGCCGGCAA GGTGGACATC
CGCGGCGAGA TCGGCTACCT GCCGCAGGAC CCGCGCACCG GCGACCTCGC CGACACCGCC
CGCGACCGCG TGCTCGCCGC CCGAGGCCTC GACGTGATCC TGCGCGAGAT GGAGAAGCTC
CAGCTCGAGA TGGCCGAGTT CGTCGACGAG ACGGCCCGCG ACACCGCGGT GCGTCGCTAC
GGCCGGCTGG AGGAGCGGTT CGGGATGCTC GGCGGGTACG CCGCCGAGGC GGAGGCGGCG
CGGATCTGCT CCTCACTCGG CCTGCCCGAC CGGGTTCTCG GCCAGCAGAT CGGAACGCTC
TCGGGTGGCC AGCGCCGCCG CGTCGAGCTG GCCCGGATCC TGTTCGCCGG TTCGGGGAAC
TCCGACGCGA CACTGCTGCT CGACGAGCCG ACCAACCACC TCGACGCCGA CTCGATCACC
TGGCTGCGTG ACTTCCTGCG CGCCCATTCC GGCGGCCTGA TCGTCATCAG CCACGACGTC
GACCTGCTGG ACAAGAGTGT GAACAAGGTT TTCCATCTCG ACGCCAACCG CGCCGCGCTC
GACGTCTACA ACGTCAACTG GAAGACCTAT CTCACCCAGC GCGATCAGGA CGAGCGCCGC
CGGCGGCGGG AGCGGGCCAA CGCGGAGAAG AAGATCGACT CGTTGCGGGC GCAGGCCGAC
AAGATGCGGG CGAAGGCCAC CAAGGCGCGC GCGGCACACC AGATGGACCG CCGCGCGGAG
CGCCTCGCCT CCGGCCTCGC CGAGGTCCGG GTCGCCGACC GGGTCGCCAA GCTGCGCTTC
CCCGATCCGG CTCCGTGCGG CCGCACGCCG CTGACCGCCA CCGGCCTGTC GAAGTCCTAC
GGCTCGCTGG AGGTGTTCAC CGGCGTCGAC CTGGCGATCG ACCGCGGCAC CCGGGTCGTG
GTCCTGGGCC TCAACGGCGC CGGCAAGACG ACGCTGCTGC GCATGCTCGC CGGCCAGGAG
ACCCCGGACG CCGGCGAGGT GCACCCGGGG CACGGCCTGC GCCTGGGGTA CTACGCGCAG
GAGCACGAGA CGCTCGACAC CTCCCGCACG GTGCTCGACA ACATGCGCGC CGCGGCGCCC
ACCGCCTCCG ACGTCGACCT GCGTCGCATC CTGGGCGCAT TCCTGTTCGG GGGGGACGCG
GTCGAACAGC TCGCCGAGAC GCTCTCCGGC GGTGAGAAGA CCCGGCTGGC CCTGGCTGGC
CTGGTCTGCA GCTCGGCGAA CGTGCTGCTG CTCGACGAGC CGACGAACAA CCTCGACCCG
GCGTCGCGCG ACGAGGTCCT GAGCGCACTG CGCACCTACC GGGGCTCGGT GGTCCTCGTC
ACGCACGACC CGGGCGCGGT CGAGGCACTG AGCCCCGACA AGGTCCTGAT GCTGCCGGAC
GGCGTCGAGG ACACCTGGTC GCCCGACCTC GCCGATCTCG TCACCCTGGC CTGA
 
Protein sequence
MIIVSELELR AGARTLIEPV SFRVQPGDRI GLVGRNGAGK TTMLKTLAGE TLPFAGKVDI 
RGEIGYLPQD PRTGDLADTA RDRVLAARGL DVILREMEKL QLEMAEFVDE TARDTAVRRY
GRLEERFGML GGYAAEAEAA RICSSLGLPD RVLGQQIGTL SGGQRRRVEL ARILFAGSGN
SDATLLLDEP TNHLDADSIT WLRDFLRAHS GGLIVISHDV DLLDKSVNKV FHLDANRAAL
DVYNVNWKTY LTQRDQDERR RRRERANAEK KIDSLRAQAD KMRAKATKAR AAHQMDRRAE
RLASGLAEVR VADRVAKLRF PDPAPCGRTP LTATGLSKSY GSLEVFTGVD LAIDRGTRVV
VLGLNGAGKT TLLRMLAGQE TPDAGEVHPG HGLRLGYYAQ EHETLDTSRT VLDNMRAAAP
TASDVDLRRI LGAFLFGGDA VEQLAETLSG GEKTRLALAG LVCSSANVLL LDEPTNNLDP
ASRDEVLSAL RTYRGSVVLV THDPGAVEAL SPDKVLMLPD GVEDTWSPDL ADLVTLA