Gene PCC7424_4572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_4572 
Symbol 
ID7108322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp5056600 
End bp5058033 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content43% 
IMG OID643482789 
ProductGeneral substrate transporter 
Protein accessionYP_002379802 
Protein GI218441473 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAA TTACTCAACC TACATCTTTT GAAGAACGTC TCAATCAGTC TAAGATTACT 
CCCACAATGT GGCTACTGTG GGCTTTATCT GCCGGCTTAA TCGCCCTTGA TGGATTTGAT
TTTTTTATCA TTGGCGTTGC CCTTCCTTTC CTGCAACGAG ATTTTAGTCT GACTGCTGTA
GAAATTGCGG CTGTAGCTGT GGCGGCAGTA TCCGGTTCTT TATTAGGATC TTTGACTCTA
GGCCCTGTTA CCGACAAAAT CGGGCGACAA GTTATGTTAT TAGTAGATGT GGCTATTTTT
GTCATTGCTA CCGCCGGAAC AGCACTGGCT TGGAATGGGG TTTCCTTAAT TATTTTTCGT
TTTTTGGTCG GGATAGGAAT TGGGGCCGAT TATCCCATTA GTGTCTCTTA TATTACTGAA
AATGTCCCTT CTCGGTTACG AGGTCGGATG GTCATTGGGG CATTTACTTT TCAAGCTTTT
GGTGCTTTTT TAGGAGCAAT AACCGGACTT TTTGTGATTC ATATTTTTAA TCTCCTTTAC
CCCGACTCTC CACAACCGGC GATTCAGTAC GCTTGGCGCT GGATGCTTGG GGTAGGATTA
TTATTAGCGA TCGCTGTCGG GATCTTACGG TTAAGTTTTT TACTCGAAAG TCCTCGGTAT
TATATTGCCA GAGGAGAGTA TGAGGAAGCG TCTAAAGCGG CTTCTACCCT ACTGGATGAA
CCCATTAATA TTACTCCCGA AACCGACCCC CCGCAACGTG AACCGAATTT ACCCTATTGG
GCGTTATTTG CGTCTGGATA TCGCCAACGG ACAATTTTAG CCTCAGTGCC TTGGTTTTTA
CAAGATATTG CTACTTATGG TATTGGAATT TTTACTCCTG CGATTATCGG CGTTTTAGCC
TTTGCGAGGG AAGATAATTT TATGGCCAGG GAAATGGCAT CGGCTAAAGG TTCGGCGTTT
GTAGATTTGT TTTTGATTGC CGGTTTTATC ATGGCTGTCA TTTTAATTGA GCCAGTGGGA
CGGATGAAAT TGCAAATTAT CGGCTTTTTG GGAATGGCTA TAGGACTGTT AATTTTGGCG
GCATCTAATA GTTTAGGGGA TGAAACTAAT ATTACCCTGG TTTTTTGTGG TTTTTTGGTG
TTTAATCTCA TGATGAATGC TGGGCCGAAT TCTACTACTT TTCTCTTATC GGGAGAAATC
TTTCCGACTT CAATTCGCGC CAGTGGAGCC GGGTTTGCTG CTGCTTTTGC TAAAGCCGGA
GCGGTTGTGG GGACTTTTGC CTTACCGCTT TTGCAAAACT CTCTCGGAAC GGCTACTTTA
TTGATAGTAC TCTCTTTGCT GTGTGTTTTG GCTGCAATTA TTACTTATTT TTATCGTATT
GAAACGGGTG GGCGATCGCT AGAAGCCGTC GATCAAGTAG AGTTTACTCA ATAG
 
Protein sequence
MTTITQPTSF EERLNQSKIT PTMWLLWALS AGLIALDGFD FFIIGVALPF LQRDFSLTAV 
EIAAVAVAAV SGSLLGSLTL GPVTDKIGRQ VMLLVDVAIF VIATAGTALA WNGVSLIIFR
FLVGIGIGAD YPISVSYITE NVPSRLRGRM VIGAFTFQAF GAFLGAITGL FVIHIFNLLY
PDSPQPAIQY AWRWMLGVGL LLAIAVGILR LSFLLESPRY YIARGEYEEA SKAASTLLDE
PINITPETDP PQREPNLPYW ALFASGYRQR TILASVPWFL QDIATYGIGI FTPAIIGVLA
FAREDNFMAR EMASAKGSAF VDLFLIAGFI MAVILIEPVG RMKLQIIGFL GMAIGLLILA
ASNSLGDETN ITLVFCGFLV FNLMMNAGPN STTFLLSGEI FPTSIRASGA GFAAAFAKAG
AVVGTFALPL LQNSLGTATL LIVLSLLCVL AAIITYFYRI ETGGRSLEAV DQVEFTQ