Gene Francci3_3392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3392 
Symbol 
ID3905974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4021130 
End bp4022569 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content67% 
IMG OID637880714 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_482475 
Protein GI86742075 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCTC AGCCCAGCAC ACACCCGGGA TGGAACCCCG GCCAGCTACG CCGCCACCGC 
GAACAACTCG GACTGAGCCG GGCCGCCCTC GCCGACAAGA TCGCCAACAT TGACCCTGTC
GTGGCCAAGG AAGCGGGGTT CACCCCACCG GCCGCCGGCC CCGAGATGAT CAGCAAGCAC
GAGCGGGGCG TCAACTTCCC CGGCACGTCC TACCAGGCGG CTTACTGCCA CTTTTTCCAG
GCCAGCGAAC CCGAACTCGG ATTCCGCTAC CCCTACCCCC ACGAACACAA AGACGATCAT
GCGGGGTCTG TTACACCACC GCTAGCCGGG ATACCCTCAC AACACCAGCC CACGCCGCAG
ACCGAGGGGG TGGAGACCGC GAACCGCAGA GACCTACTCA GCCTCACCGC CGCAGCCATA
GGACTCACCG CCACCGGCAC AGCCGCAGCG ATGATCGCGC CCGCCGACAG GTTGGCCATC
CTGGAACGCG CCACCGCCGG CAGCGGCGCG GCATCCGCGG CCGAAGGGGC CCTACAGGCC
GTGGTGGCCG ACTACCTGCA CCATCCCCCC GCCGAGACAT TGCGCCGGGC CATCGCGTTG
CAGCAGCTCA CCGACGCCAT CACCGCCCAA TACCCGCTAC GGCCCGCCGA CCAGGCCCGG
GTCTGGCGCG TATCCGGTGT CGCCACCGGC ATCCGCGGAT GGCTGGAGAA CAACGCCGGA
CACACCACCG ACGCCCGGCT GTCCTTGCGG GAAGCCCACC GCAGAGGCGA ACTCCTCGAA
GACAATCAGT TGATCGCCTG GACGCGCTTC ATGCAGGCCA CCATCGAGGA CTACGCCGGC
AACCCCGCCG GAGCCGAACA GTACGCGCTC GACGGCCTGC GCCACGCACC CAGCGGACCA
CAACGGGCTA TCTTGCTGGT GGGCAGCCTT GCCGGGGCCC GCGCCGCCCA TGGCGACATC
CGAGGAGTTG ACGCGGCCGT CAGCGAGGCC GAAAGCATCG TTAGCAGGCT GACTCCAGAC
GAGCGCGGCC CGCGAGAGGA TCATCGTACC GTCGTTGACA GCCTTACCAG CATCGGATTG
CCGATCTTCT CTCTCGACGT CGGCCGAGTG TATTCCCGCC TTGGCGAGAT AGGCCGCTTC
ATGGAGGTAA CGGCCGACGT CCGACCCGCC ATCGAGCAGG GCAGTAGCTC ACGGACGTCC
GTGTTCCGCG CGGATGAGGC AGTAGCCGTT GCACGGAGCA AGTCACCGGA TCTTGACAGG
GTGGCCGCAC TTGCCCGCGA AAGTTTGCGT CTCGCTGGCC CGTTCCAGAC CGCTCACATC
GCAAGCCGGG TCAACGTCGT CCTCGCCGTG ACTGACCGCG ACTATCCCGC GATTCGATCA
TTGGCCGACG AGGCCCACAC CTGGCGGATC AGCCGCAACA GGCCCATCGA CCTCGTCTAA
 
Protein sequence
MPPQPSTHPG WNPGQLRRHR EQLGLSRAAL ADKIANIDPV VAKEAGFTPP AAGPEMISKH 
ERGVNFPGTS YQAAYCHFFQ ASEPELGFRY PYPHEHKDDH AGSVTPPLAG IPSQHQPTPQ
TEGVETANRR DLLSLTAAAI GLTATGTAAA MIAPADRLAI LERATAGSGA ASAAEGALQA
VVADYLHHPP AETLRRAIAL QQLTDAITAQ YPLRPADQAR VWRVSGVATG IRGWLENNAG
HTTDARLSLR EAHRRGELLE DNQLIAWTRF MQATIEDYAG NPAGAEQYAL DGLRHAPSGP
QRAILLVGSL AGARAAHGDI RGVDAAVSEA ESIVSRLTPD ERGPREDHRT VVDSLTSIGL
PIFSLDVGRV YSRLGEIGRF MEVTADVRPA IEQGSSSRTS VFRADEAVAV ARSKSPDLDR
VAALARESLR LAGPFQTAHI ASRVNVVLAV TDRDYPAIRS LADEAHTWRI SRNRPIDLV