Gene Francci3_1670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1670 
Symbol 
ID3903057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2004888 
End bp2006501 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content70% 
IMG OID637879008 
ProductABC transporter related 
Protein accessionYP_480775 
Protein GI86740375 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATTG TGTCCGAGCT CGAACTGCGT GCCGGAGCCC GCACTCTGAT CGAGCCCGTA 
TCCTTCCGGG TGCAGCCGAA CGACCGGATC GGTCTCGTCG GGCGCAACGG GGCCGGCAAG
ACAACGTTGC TGAAGGTGCT TGCCCGGGAG GGGCTGCCGT TCGCCGGCAC CGTCGACATC
CGGGGTGAGA TGGGTTACCT CCCCCAGGAC CCGCGTACCG GCGATCTGGC CGACACCGCC
CGTGATCGGG TGCTGTCGGC GCGCGGTCTC GACGTTCTCC TGCGCGAGAT GGAGAAGCTG
CAGCTGGAGA TGGCCGAGCT GGTCGACGCC ACGGCGCGGG ACGCGGCGAT CCGCCGCTAC
GGCAGCCTGG AGGAGCGATT CGGGACGCTC GGCGGTTATG CCGCCGAGGC GGAGGCTGCC
CGGATCTGCT CGTCGCTGGG CCTGGCCGAT CGGGTGCTCG CCCAGCCCAT CGGCACCCTG
TCCGGTGGGC AGCGGCGGCG GGTCGAACTC GCCCGCATCC TGTTCGCCGG GTCCGGCAAC
GCCGACGCGA CCCTGCTGCT CGACGAGCCG ACGAACCACC TCGACGCCGA CTCGATCGGG
TGGCTGCGGG ACTTCCTGCG CGCCCACACG GGCGGCCTGA TCGTGGTGAG CCACGACGTC
GACCTGCTCG ACAAGTGCGT GAACAAGGTC TTCCACCTCG ACGCCAACCG GGCCACCCTC
GACGTCTACA ACGTGAACTG GAAGACCTAC CTCAACCAGC GCGAGCTGGA CGAGCGCCGC
CGTCGCCGGG AACGCGCCAA TGCCGAGAAG AAGATCGACT CGCTGAAGGC GCAGGCCGAC
AAGATGCGCG CCAAGGCGAC CAAGGCGCGG GCCGCCCATC AGATGGACCG GCGGGCCGAG
CGGCTCGCGG CCGGGCTCGC CGAGGCGCGC GTCGCGGACC GGGTGGCCAA GCTGCGCTTC
CCGGACCCGG CTCCGTGCGG CCGCACGCCG CTGACCGCCA CCGGACTGTC GAAGTCCTAC
GGCTCGCTGG AGGTGTTCAC CGGTGTGGAC CTCGCGATCG ACCGCGGCTC GCGGGTCGTG
GTCCTCGGGC TGAACGGTGC CGGCAAGACC ACGCTGCTGC GCATCCTGGC GGGCCAGGAG
GCACCCGACG TCGGCCAGGT GCATCCCGGT CACGGTCTAC GTCTCGGGTA CTACGCGCAG
GAACACGAGA CGCTGGACAC CAGCCGCTCG GTGCTGGACA ACATGCGGGC CGCCGCGCCG
GGGACGTCGG ACGTGGAGCT GCGTCGCATC CTGGGCGCGT TCCTGTTCAG CGGAGACAGC
GTGGAGCAGC GCGCCGAGAC CCTGTCCGGC GGCGAGAAGA CCCGCCTCGC GCTCGCCGGA
CTGGTCTGCA GCTCCGCCAA CGTCCTCCTG CTCGATGAGC CGACGAACAA CCTCGACCCC
GCCTCGCGCG ACGAGGTGCT TGCCGCGCTC GCCACGTACC GGGGCTCGGT CGTCCTAGTC
ACCCACGACC CGGGGGCCGT CGAGGCACTC GACCCGCAGA AGGTCCTGAT GCTGCCGGAC
GGGGTCGAGG ACAACTGGTC ACCCGATCTG GCCGAGCTCA TCACCCTGGC CTGA
 
Protein sequence
MIIVSELELR AGARTLIEPV SFRVQPNDRI GLVGRNGAGK TTLLKVLARE GLPFAGTVDI 
RGEMGYLPQD PRTGDLADTA RDRVLSARGL DVLLREMEKL QLEMAELVDA TARDAAIRRY
GSLEERFGTL GGYAAEAEAA RICSSLGLAD RVLAQPIGTL SGGQRRRVEL ARILFAGSGN
ADATLLLDEP TNHLDADSIG WLRDFLRAHT GGLIVVSHDV DLLDKCVNKV FHLDANRATL
DVYNVNWKTY LNQRELDERR RRRERANAEK KIDSLKAQAD KMRAKATKAR AAHQMDRRAE
RLAAGLAEAR VADRVAKLRF PDPAPCGRTP LTATGLSKSY GSLEVFTGVD LAIDRGSRVV
VLGLNGAGKT TLLRILAGQE APDVGQVHPG HGLRLGYYAQ EHETLDTSRS VLDNMRAAAP
GTSDVELRRI LGAFLFSGDS VEQRAETLSG GEKTRLALAG LVCSSANVLL LDEPTNNLDP
ASRDEVLAAL ATYRGSVVLV THDPGAVEAL DPQKVLMLPD GVEDNWSPDL AELITLA